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IMPROVED LYSOSOMAL ENZYMES AND LYSOSOMAL ENZYME ACTIVATORS 

5 FIELD OF INVENTION 

The present invention relates to modified lysosomal enzymes and modified lysosomal enzyme 
activators having improved properties, methods of preparing such polypeptides and their use in 
therapy, in particular enzyme replacement therapy for the treatment of lysosomal storage diseases. 

10 

BACKGROUND OF THE INVENTION 

Lysosomes are acidic cytoplasmic organelles present in all animal cells. Lysosomes contain a 
15 variety of hydrolytic enzymes (lysosomal enzymes) that degrade internalized and endogenous 
macromolecular substrates such as sphingolipids present in the lysosymes. Deficiency of one or 
more of such enzymes leads to accumulation of undegraded substrate and eventually onset of a 
lysosomal storage disease. More than thirty distinct, inherited lysosomal storage diseases have been 
reported, some of which can be treated by presently available enzyme replacement therapy. Such 
20 diseases (and related lysosomal enzymes) include Fabry's disease (ct-galactosidase), Farber* s 
disease (ceramidase), Gaucher disease (glucocerebrosidase), gangliosidosis ((3-galactosidase), 
Tay-Sachs disease (^-hexosaminidase), Niemann-Pick disease (sphingomyelinase), Shindler disease 
(C^N-acetylgalactosaminidase), Hunter syndrome (iduronate-2-sulfatase), Sly syndrome (fl- 
glucuronidase), Hurler and Huler/Scheie syndromes (iduronidase), I-Cel I/San Filipo syndrome 
25 (mannose 6-phosphate transporter), Pombe's disease (cc-glucosidase). The diseases and related 
enzymes are described in a variety of publications, see e.g. Scriver et al., The metabolic and 
molecular bases of inherited disease, volume II part 12, Lysosomal enzymes, pp. 2427-2882, New 
York McGraw-Hill 1995, and US 5,929,304. For instance, US 5,580,757 discloses expression of 
alpha -galactosidase. 

30 Activators of lysosomal enzymes are known, examples of which are the Saposins. Saposin 

A (SapA), Saposin B (SapB), Saposin C (SapC) and Saposin D (SapD) are generated in lysosomes 
from a common precursor, called prosaposin, whose proteolytic cleavage begins in the late 
endosomes ((Nakano et al., J. Biochem. (Tokyo) 105, 152-154, 1989; Gavrieli-Rorman and 
Grabowski, Genomics 5, 486-492, 1989), Vielhaberet al. J. Biol. Chem. 271, 32438-32446, 1996). 

35 All Saposins appear to be involved in the lysosomal degradation of sphingolipids. A patient lacking 
all four saposins showed a combined sphingolipid storage disorder. So far selective deficiences of 
saposins are only known for SapB and SapC Mutations affecting the coding region of SapB cause a 
variant form of metachromatic Ieukodistrophy with storage of sulfatides (Schlote et al., Eur. J. 
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Pediatr, 150, 584-591, 1991). This, together with in vitro data, suggests SapB to be an activator of 
arylsulfatase A in vivo. SapC is a small 80 amino acid peptide which is an essential co-factor for the 
in vivo activity of GCB (Qi et al., J. Biol. Chem. 271, 6874-6880, 1996). SapC has been proposed to 
bind to GCB in vivo and introduce a conformational change in the enzyme thereby maximizing its 
catalytic activity (Grace et al., 1994 J. Biol. Chem; 269; 2283-2291; Qi & Grabowski, 1998, 
Biochemistry 37; 1 1544-1 1554). So far the actual physiological function of SapA and D has not 
been firmly established, but a role for SapA in the degradation of glucosylceramide and 
galactosylceramide has been hypothesized and mice studies have indicated its role in activation of 
galactocerebrosidase (Oral information, Vm International Congress of Inborn Errors of Metabolism, 
Cambridge, UK, 13-17 September 2000). SapD have been suggested to be involved in the ceramide 
hydrolysis (Vaccaro et al. Neurochemical Research, 24, 307-314, 1999). Mice studies have indicated 
that SapD may be an in vivo activator of oc-galactosidase (Oral information, VJU International 
Congress of Inborn Errors of Metabolism, Cambridge, UK, 13-17 September 2000). 

Gaucher's disease is an autosomal recessive disease resulting in a deficiency of the 
lysosomal hydrolase, acid (J-glucosidase also termed glucocerebrosidase (E.C. 3.2.1.45) or GCB 
hereinafter. Gaucher's disease has been classified in three subtypes, cf. the table below. 



Clinical Features 


Type I 


Typell 


Type HI 


Clinical Onset 


Childhood/Adulthood 


Infancy 


Childhood 


Hepatosplenomegaly 


+ 


+ 


+ 


Hematologic Complications 


+ 


+ 


+ 


Skeletal Involvement 


+ 




+ 


Neurologic Involvement 




+ 


+ 


Survival 


Variable 


<2 yrs 


2 ffl 4" decade 


Ethnic predilection 


Ashkenazic Jewish 


Panethnic 


Nothem Swedish 



There is a wide variability in the pattern and severity of disease involvement between and 
within each subtype. All three variants of Gaucher's disease are inherited "storage" diseases but are 
distinguished by the presence or absence of neurologic complications. The defect causes progressive 
accumulation of undegraded glycolipid substrates, particularly glucosylceramide, in 
reticuloendothelial cells and results in infiltration of the bone marrow, hepatosplenomegaly, and 
skeletal complications. Gaucher's disease is the most common inheritable lysosomal disease and 
occurs with a frequency of 1/40000-1/60000 in Caucasians and 1/1000 in Ashkenazi Jews. 

The only existing treatment is enzyme substitution that has become available in the last 
decade. Initially, enzyme purified from human placentas (Ceredase™) was used, but patients are 
currently being switched to recombinantly produced enzyme, termed Cerezyme™. The enzyme is 
dispensed intraveneously (TV) up to three times a week. The treatment appears to be effective in 
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removing many of the symptoms as well as correcting the paraclinical abnormalities except the 
neurological symptoms seen in type 2 and 3. 

GCB is necessary for the breakdown of a particular fatty substance, glucosylceramide, to 
glucose and ceramide, by hydrolysis of the O-p-D-glucosidic linkage. It has been shown that the in 
5 vitro activity of the protein is elevated by the presence of acidic lipids, such as phosphatidylserine, 
and SapC. The enzyme is a lysosomal membrane protein but although the enzyme has substantial 
hydrophobic properties, no evidence for a transmembrane segment has been found. It has been 
shown by fluorescence spectroscopy, that the protein binds lipids and enters the membrane to some 
degree (Qi & Grabowski, 1998, Biochemistry 37; 1 1544-1 1554). It has been suggested that the role 
10 of SapC is to bind to GCB and introduce a conformational change in the enzyme thereby 
maximizing the catalytic activity (Qi & Grabowski, 1998, Biochemistry 37; 11544-11554). 
The gene encoding human GCB was first sequenced in 1985 (Sorge et al„ 1985, Proc. Natl. Acad 
Sci.; 2; 7289-7293). The protein consists of 497 amino acids derived from a 536-mer pro-peptide. 
The enzyme contains 4 glycosylation sites and 22 lysines. The recombinantly produced enzyme 
15 (Cerezyme™) differs from the placental enzyme (Ceredase™) in position 495 where an arginine has 
been substituted with a histidine. Furthermore, the oligosaccharide composition differs between the 
recombinant and the placental GCB as the former has more fucose and N-acetyl-glucosamine 
residues while the latter retains one high mannose chain. Both types of GCBs are treated with three 
different glycosidases (neuraminidase, galactosidase, and P-N acetyl-glucosaminidase) to expose 
20 terminal mannoses, which enables targeting of phagocytic cells. A pharmaceutical preparation 
comprising the recombinantly produced enzyme is described in US 5,549,892. 

WO 89/05850 discloses a clone of GCB and its expression in invertebrate cells. 

WO 90/07573 discloses a recombinant enzymatically active GCB produced by a eukaryotic 
cell such as an insect, yeast or mammalian cell. The enzyme comprises as least one exposed 
25 mannose residue for binding to the mannose receptor of phagocytic cells. 

EP 401 362 Bl discloses the production of GCB in CHO cells. The GCB is indicated to 
include an oligosaccharide moiety with at least one exposed mannose residue and preferably 2-4 
mannose residues. 

US 5,433,946 discloses lectin-lysosomal enzyme conjugates and their use in treatment of 
30 lysosomal storage diseases. Glucocererbrosidase is mentioned as one enzyme among many to be 
modified and used in accordance with the teaching of US 5,433,946. 

US 5,929,304 discloses production of lysosomal enzymes, exemplified by GCB, in 
transgenic plant cells. 

US 5,705,153 discloses GCB conjugates with non-antigenic polymers such as polyethylene 
35 glycol. The conjugates are claimed to exhibit enhanced turnover time and prolonged in vivo activity. 
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The drawbacks of the previously suggested forms of GCB have been an insufficient 
targeting of GCB to phagocytic ceils. It has been shown that while 50-60% of administrated enzyme 
in mice was taken up by the liver, only approximately 10% was correctly targeted to liver 
phagocytic cells (Kupffer cells) (Bijsterbosch et al„ 1996, Eur. J. Biochem, 237; 344-349 and 
5 Friedmann et al.,1999, Blood, 93; 2807-28 16). This incorrect targeting, combined with a short half- 
life in serum (minutes) and in lysosomes (2-12 hours), results in a non-optimal treatment of Gaucher 
patients. 

Doebber et al„ J. Biol. Chem., 257, pp2193-2199, 1982 reports enhanced macrophage 
uptake of synthetically glycosylated human placental GCB. 

10 One drawback associated with existing lysosomal enzyme replacement therapy treatment is 

that the in vivo bioactivity of the enzyme is undesirably low, e.g. because of low uptake and/or 
reduced targeting to lysosomes of the specific cells where the substrate is accumulated, and/or a 
short functional in vivo half-life in the lysosomes. Because of the low in vivo bioactivity frequent 
injections are required in current therapy. Accordingly, a need exists for providing lysosomal 

15 enzymes with improved in vivo activity. 

SUMMARY OF THE INVENTION 

20 The object of the present invention is to improve the in vivo bioactivity of lysosomal enzymes and 
thereby provide an improved treatment of lysosomal storage diseases. This is achieved by providing 
modified lysosomal enzymes and/or modified lysosomal enzyme activators with improved 
properties, such as improved uptake in lysosomal cells and improved functional in vivo half-life. 
In one aspect the invention relates to a polypeptide selected from the group of lysosomal 

25 enzymes and lysosomal enzyme activators, which polypeptide comprises at least one introduced 
glycosylation site as compared to a corresponding, preferably naturally-occurring, parent enzyme or 
activator. By introducing additional glycosylation sites increased and/or specific glycosylation may 
be achieved which is contemplated to lead to an improved uptake in the relevant cells or organelles 
and increased functional in vivo half-life (presumably as a consequence of reduced proteolytic 

30 degradation). 

In another aspect the invention relates to a chimeric polypeptide comprising a lysosomal 
enzyme unit linked to at least one unit of an activator for said enzyme or a targeting polypeptide 
capable of targeting phagocytic cells. Thereby, the uptake and in vivo activity is improved as 
compared to the lysosomal enzyme in itself. 
35 The invention also provides for a conjugated polypeptide, the polypeptide part of which is 

selected from the group of a lysosomal enzyme and a lysosomal enzyme activator and has at least 
one introduced and/or at least one removed attachment group for a macromolecular moiety as 
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compared to a corresponding parent polypeptide, the polypeptide part being conjugated to at least 
one macromolecular moiety different from an oligosaccharide moiety. Of particular interest is a 
macromolecular moiety that is a polymer molecule such as PEG. 

In still further aspects, the invention relates to a nucleotide sequence encoding a polypeptide 
5 of the invention, a vector and host cell comprising said nucleotide sequence, as well as a method of 
producing the polypeptide. 

In a further aspect, the invention relates to a method of improving at least one property of a 
lysosomal enzyme, such as increasing in vivo activity thereof, which method comprises introducing 
an additional glycosylation site (or attachment group for a non-oligosaccharide moiety) into the 
10 lysosomal enzyme, preferably at a position exposed at the surface of the protein, and producing the 
modified lysosomal enzyme under conditions ensuring that the enzymes is glycosylated (or 
conjugated to the non-oligosaccharide moiety). 

In still further aspects, the invention relates to a pharmaceutical composition comprising a 
polypeptide of the invention and a pharmaceutically acceptable diluent, carrier or excipient and to 
15 the use of the polypeptide for the treatment or prevention of a lysosomal storage disease treatable by 
the polypeptide or for the manufacture of a medicament for treatment or prevention of such disease. 

The general principle of the present invention is illustrated herein predominantly by 
modification of GCB and accordingly, a specific object is to provide enzymatically active forms of 
GCB with increased in vivo activity, in particular with increased targeting to phagocytic cells and/or 
20 increased lysosomal activity. However, it is generally believed that the concept described herein for 
modification of GCB is generally applicable to other lysosomal enzymes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 

Figure 1: Uptake (Dosis-respons) in J774E cells of selected GCB polypeptides compared to 
Cerezyme. Different concentrations (400 mU/ml -1 5mU/ml) of the GCB polypeptides were 
incubated with the cells in absence (closed symbols) or in the presence of yeast mannan (open 
symbols) as described in Methods section. The amount of GCB polypeptide taken up by the cells 

30 was determined by GCB Activity Assay. A; Raw data. B; Data corrected for mannose baseline. 

Figure 2. Stability of selected GCB polypeptides in J774E cells compared to Cerezyme™. 
Briefly, cells were incubated with 40 mU/ml enzyme f or 1 hr before washing the cells and then 
measuring the amount of enzyme left in the cells after 30 min, 1 hr, 2 hr, 3 hr, 4 hr, and 5 hr. using 
the GCB Activity Assay. 

35 Figure 3. Activation of GCB polypeptides and Cerezyme™ in response to increasing 

amount of phosphatidyl serine from Bovine brain using the assay described in Methods. 
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Figure 4. Activation of GCB polypeptides and Cerezyme™ in response to increasing 
amounts of SapC. The assay was done at pH 4.7 and in the presence of 5 u.g/ml phosphatidyl serine 
and increasing amounts of SapC. For details, see Methods. A; Raw data curves and B; normalized 
curves. 

5 Figure 5: A schematic drawing showing the principle of random introduction of 

glycosylation sites (as further described in Example 2) . 

Figure 6: SDS-PAGE of PEGylated wtGCB. Mark 12™ is a Mw marker, available from 
Novex, San Diego, CA. 5X, 20X and 120X, respectively, indicates a 5, 20 and 120 times molar 
excess of PEG relative to the number of lysine residues. 
10 Figure 7: Uptake in J774E cells of PEGylated wt GCB. 

Figure 8: Preferred oligosaccharide structures 

DETAILED DISCLOSURE OF THE INVENTION 
15 Definitions 

In the present context, the term "polypeptide" is intended to indicate any structural form (e.g. the 
primary, secondary or tertiary structure) of an amino acid sequence comprising more than 5 amino 
acid residues. Thus, the term is intended to include the folded form of the polypeptide, otherwise 
termed "protein". The term polypeptide is used herein about any polypeptide of the invention in any 

20 form, whether a chimeric polypeptide or a polypeptide comprising a peptide addition. The "GCB 
polypeptide" is a polypeptide exhibiting GCB activity, i.e. a polypeptide which is capable of 
degrading a glycolipid substrate, in particular 4-MU-glucopyranoside or p-nitrophenyl- 
glucopyranoside as described in the Methods section hereinafter. Typically the GCB polypeptide 
comprises more than 100 amino acid residues such as more than 300 amino acid residues, e.g. 100- 

25 500 amino acid residues. A "SapC polypeptide" is a polypeptide exhibiting SapC activity, i.e. 

capability of activating a GCB polypeptide, e.g. demonstrated by use of the SapC activation assay of 
GCB described in the Methods section herein. Analogously, a "SapA polypeptide" is a polypeptide 
exhibiting SapA activity, a "SapB polypeptide" is a polypeptide exhibiting SapB activity and a 
"SapD polypeptide" is a polypeptide exhibiting SapD activity, such activities being determined by 

30 methods known in the art. Furthermore, the "polypeptide" may be derivatized and thus be in the 
form of a "conjugated polypeptide" comprising a macromolecular moiety. 

The term "conjugated polypeptide" is intended to indicate a heterogeneous (in the sense of 
composite) molecule formed by the covalent attachment of one or more polypeptide(s) to one or 
more macromolecular moieties such as polymer molecules or oligosaccharide moieties. The term 

35 covalent attachment means that the polypeptide and the macromolecular moiety are either directly 
covalently joined to one another, or else are indirectly covalently joined to one another through an 
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intervening moiety or moieties, such as a bridge, spacer, or linkage moiety or moieties. Preferably, 
the conjugated polypeptide is soluble at relevant concentrations and conditions, i.e. soluble in 
physiological fluids such as blood. The term "non-conjugated polypeptide" may be used about the 
polypeptide part of the conjugate. A glycosylated polypeptide constitutes one example of a 
conjugated polypeptide as used herein. Another example is a PEGylated polypeptide. 

The term "wildtype" or "wt" is used about any naturally-occurring lysosomal enzyme or 
lysosomal enzyme activator, either it be isolated from its natural source or produced recombinantly 
(in the latter case the wt polypeptide has the amino acid sequence of the corresponding polypeptide 
isolated from its natural source). Thus, the term is used about any naturally-occurring human or 
other (e.g. primate or murine) lysosomal enzyme or activator, including allelic or other naturally- 
occurring variants or functional fragments exhibiting the relevant lysosomal enzyme or activator 
activity, preferably at least 25% of the activity of the corresponding wt enzyme or activator. 

In the case of GCB it is well known that numerous naturally-occurring GCBs exist which 
differ from each other in one or more amino acid residues and the term "wtGCB" is intended to 
mean any such naturally-occurring GCB. For instance, the wtGCB is an endogenous enzyme 
purified from human cells, in particular human placenta, or an enzyme produced recombinantly on 
the basis of a gene or cDNA sequence encoding such naturally-occurring GCB. Specific examples 
of "wtGCB" cDNA sequences (as defined in the present context) are those described by Sorge et al, 
Proc. Natl. Acad. ScL USA 82, 7289-7293, 1985 and in US 5,879,680, the amino acid sequences of 
which are comprised in SEQ ID NO 1. 

The term "parent" is used about the starting polypeptide to be modified in accordance with 
the invention. The parent polypeptide may be a wt polypeptide or a variant or functional fragment 
thereof. Typically, a "variant" shows at least 80% sequence identity with an amino acid sequence 
encoding the relevant wt polypeptide, in particular at least 90% identity, such as at least 95% 
identity. For instance, a GCB polypeptide variant shows at least 80% sequence identity with the 
amino acid sequence shown in SEQ ID NO 1, in particular at least 90% identity, such as at least 
95% identity with said sequence. The sequence identity is calculated from the most optimal 
alignment of the relevant sequences using a suitable program (e.g. CLUSTAL W). A "functional 
fragment" of a full-length wt or variant polypeptide is typically deleted in one or more amino acid 
residues of the N- and/or C-terminal end, while retaining the qualitative activity of the full-length 
polypeptide. For instance, a functional fragment of a full-length GCB polypeptide comprises, e.g. at 
least 100 amino acid residues, such as 250-490 amino acid residues, and has GCB activity, 
preferably at least 25% of the GCB activity of the corresponding full-length GCB polypeptide. A 
functional fragment of a lysosomal enzyme comprises at least the catalytic site of the enzyme. 
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The term "increased in vivo activity" is defined as 1) increased or prolonged activity in 
patients such that a lower dosage and/or less frequent infusions lead to equal or better treatment 
efficacy as compared to that obtained by the unmodified enzyme or by conventional GCB or other 
lysosomal enzyme therapy, 2) increased or prolonged activity in mononuclear cells, more preferably 
in the isolated lysosomes, harvested from patients treated with a polypeptide of the invention as 
compared to that obtained by a reference molecule, 3) increased or prolonged activity in phagocytic 
cells, e.g. Kupfer cells or peritoneal macrophages, isolated from mice pre-treated with a polypeptide 
of the invention as compared to that obtained by a reference molecule, 4) increased or prolonged 
activity in macrophage like cell lines, more preferably in isolated lysosomes therefrom, after 
exposure to a polypeptide of the invention (essentially as described below in the experimental 
section) as compared to that obtained by a reference molecule, 5) improved uptake of the 
polypeptide in the lysosomes of phagocytic cells, e.g. macrophage like cells, as compared to a 
reference molecule, 6) increased half-life of the polypeptide in the lysosomes as compared to that of 
a reference molecule, and/or 7) increased stability in serum and/or in phagocytic cells/lysosomes, 
e.g. seen as decreased sensitivity to proteolytic degradation, increased half-life and the like, as 
compared to a reference molecule. 

The "reference molecule" is normally the parent polypeptide or an available commercial 
product comprising the parent polypeptide. For instance, in the case of a GCB polypeptide, the 
reference molecule is typically Cerezyme™ or Ceredase™ or a recombinantly produced wtGCB, 
e.g. the enzyme resulting from expression of the cDNA sequence shown in US 5,879,680 in an sf9 
insect cell (e.g. as described in Example 1 hereinafter). 

Increased or prolonged activity as used above is conveniently measured in terms of 
increased functional in vivo half-life. The term "functional in vivo half-life" is used in its normal 
meaning, i.e. the time in which 50% of the enzyme activity of the polypeptide is retained under in 
vivo conditions, e.g. under the conditions mentioned above. Preferably, the term is applied to the 
enzyme activity in macrophage like cells isolated from patients or animals treated with the enzyme 
or in lysosomes isolated from these cells. 

The term "increased" as used about the in vivo activity, or the serum or the functional in 
vivo half-life is used to indicate that the relevant activity or half-life of the polypeptide is statistically 
significant increased relative to that of a reference molecule. Preferably, the increased in vivo 
activity (i.e. any of the specific properties listed above or any combination of two or more of such 
properties) of a polypeptide of the invention is at least 110% of that of a reference molecule (e.g. the 
unmodified enzyme), in particular at least 120%, such as at least 130% or 140%, when measured 
under comparable conditions. Even more preferably, the increased in vivo activity is at least 150%, 
such as at least 160% or at least 170% or at least 200% of that of a reference molecule (e.g. the 
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unmodified enzyme). For instance, the functional in vivo half-life is at least 10% higher, such as at 
least 50% higher, preferably at least 100% higher than that of a wt parent polypeptide, e.g. wtGCB. 

The term "immunogenicity" as used in connection with a polypeptide of the invention is 
intended to indicate the ability of the polypeptide to induce a response from the immune system. The 
5 immune response may be a cell or antibody mediated response (see, e.g., Roitt: Essential 

Immunology (8 th Edition, Blackwell) for further definition of immunogenicity). Normally, reduced 
antibody reactivity will be an indication of reduced immunogenicity. 

The term "reducing the immunogenicity" is intended to indicate that the polypeptide of the 
invention gives rise to a measurably lower immune response than a reference molecule as 
10 determined under comparable conditions. The reduced immunogenicity may be determined by use 
of any suitable method known in the art, e.g. in vivo or in vitro. 

The term "attachment group" is intended to indicate a functional group of an amino acid 
residue group capable of attaching a macromolecular moiety such as a polymer molecule, an 
oligosaccharide moiety, a lipophilic molecule or an organic derivatizing agent. Useful attachment 
15 groups and their matching macromolecular moieties are apparent from the table below. 



Attachment 
group 


Amino acid 


Examples of 

macromolecular 

moiety 


Conjugation 

method/Activated 

PEG 


Reference 


-NH 2 


N-terminal, Lys 


Polymer, e.g. 
PEG 


mPEG-SPA 
Tresylated mPEG 


Shearwater Inc. 
Delgado et al, critical 
reviews in 
Therapeutic Drug 
Carrier Systems 
9(3,4):249-304 
(1992) 


-COOH 


C-term, Asp, 
Glu 


Polymer, e.g. 
PEG 

(Oligosaccharide 
moiety) 


mPEG-Hz 

(In vitro 
glycosylation) 


Shearwater Inc 


-SH 


Cys 


Polymer, e.g. 
PEG, 

Oligosaccharide 
moiety 


PEG-vinylsulphone 
PEG-maleimide 

In vitro 
glycosylation 


Shearwater Inc 
Delgado et al, critical 
reviews in 
Therapeutic Drug 
Carrier Systems 
9(3,4):249-304 
(1992) 


-OH 


Ser, Thr, OH-, 
Lys 


Oligosaccharide 
moiety 


In vivo O-linked 
glycosylation 




-CONH 2 


Asn as part of an 

N-glycosylation 

site 


Oligosaccharide 
moiety 

Polymer, e.g. 
PEG 


In vivo N- 
glycosylation 
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Aromatic 
residue 


Phe, Tyr, Trp 


Oligosaccharide 
moiety 


In vitro 
glycosylation 




-CONH 2 


Gin 


Oligosaccharide 
moiety 


In vitro 
glycosylation 


Yan and Wold, 
Biochemistry, 1984, 
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For in vivo N-glycosylation, the term "attachment group" is used in an unconventional way 
to indicate the amino acid residues constituting an N-glycosylation site (with the sequence N-X'- 
S/T/C-X", wherein X' is any amino acid residue except proline, X" any amino acid residue that 
5 may or may not be identical to X* and preferably is different from proline, N is asparagine and 
S/T/C is either serine, threonine or cysteine, preferably serine or threonine, and most preferably 
threonine). Although the asparagine residue of the N-glycosylation site is the one to which the 
oligosaccharide moiety is attached during in vivo glycosylation, such attachment cannot be achieved 
unless the other amino acid residues of the N-glycosylation site is present. Accordingly, when the 

10 macromolecular moiety is an oligosaccharide moiety and the conjugation is to be achieved by N- 
glycosylation, the term "amino acid residue comprising an attachment group for the macromolecular 
moiety" as used in connection with alterations of the amino acid sequence of the parent GCB is to 
be understood as amino acid residues constituting an N-glycosylation site is/are to be altered in such 
a manner that either a functional N-glycosylation site is introduced into the amino acid sequence or 

15 removed from said sequence. Normally, the term "glycosylation site" is used herein about an 
attachment group for an oligosaccharide moiety. 

The term "macromolecular moiety" (which may also be termed non-peptide moiety) is 
intended to indicate any molecule, different from a peptide polymer composed of amino acid 
monomers and linked together by peptide bonds, which molecule is capable of conjugating to an 

20 attachment group of the polypeptide of the invention. Examples of such molecule include 

oligosaccharides (attached by in vivo or in vitro glycosylation) and polymers (as further described in 
the section entitled "Conjugation to a non-oligosaccharide macromolecular moiety". The term 
"polymer molecule" may be used interchangeably with "polymeric group". Except where the 
number of macromolecular moieties, such as polymeric groups, in the conjugate is expressly 

25 indicated, every reference to a macromolecular moiety referred to herein is intended as a reference 
to one or more such moieties of the conjugate. 
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The term "introduce" used in relation to an amino acid residue comprising an attachment 
group for a macromolecular moiety, e.g. a glycosylation site, is primarily intended to mean 
substitution of one or more existing amino acid residues, but may also mean insertion or deletion of 
an additional amino acid residue. The term "remove" is primarily intended to mean substitution of 
the amino acid residue(s) to be removed with (an)other amino acid residue(s), but may also mean 
deletion (without substitution) of the amino acid residue to be removed. 

In the present application, amino acid names and atom names (e.g. CA, CB, NZ, N, O, C, 
etc) are used as defined by the Protein DataBank (PDB) which are based on the IUPAC 
nomenclature (IUPAC Nomenclature and Symbolism for Amino Acids and Peptides (residue names, 
atom names e.t.c), Eur. J. Biochem., 138, 9-37 (1984) together with their corrections in Eur. J. 
Biochem., 152, 1 (1985). The term "amino acid residue" is intended to indicate an amino acid 
residue contained in the group consisting of alanine (Ala or A), cysteine (Cys or Q, aspartic acid 
(Asp or D), glutamic acid (Glu or E), phenylalanine (Phe or F), glycine (Gly or G), histidine (His or 

H) , isoleucine (Be or I), lysine (Lys or K), leucine (Leu or L), methionine (Met or M), asparagine 
(Asn or N), proline (Pro or P), glutamine (Gin or Q), arginine (Arg or R), serine (Ser or S), 
threonine (Thr or T), valine (Val or V), tryptophan (Trp or W), and tyrosine (Tyr or Y) residues. The 
terminology used for identifying amino acid positions/substitutions is illustrated as follows: K7 
(indicates position #7 occupied by a lysine residue in the amino acid sequence shown in SEQ ID NO 

I) . K7N (indicates that the lysine residue of position 7 has been replaced with an asparagine). The 
numbering of amino acid residues made herein is made relative to the amino acid sequence shown in 
SEQ ED NO 1. Multiple substitutions are indicated with a "+", e.g. K7N+F9T means an amino acid 
sequence which comprises a substitution of the lysine residue in position 7 with an asparagine and a 
substitution of the phenylalanine residue in position 9 with a threonine residue. 

The polypeptide of the invention 

Introduction of glycosylation site(s) 

One important modification of lysosomal enzymes and lysomal enzyme activators described 
herein is related to changing the glycosylation profile of the enzymes and activators, with respect to 
the number of attached oligosaccharide moieties, and/or the composition of the oligosaccharide 
moieties. Li particular, the invention is focused on providing a modified lysosomal enzyme or 
lysosomal enzyme activator with an increased number of high-mannose oligosaccharide moieties as 
compared to the corresponding parent, e.g., wt enzyme or activator. 

Conveniently, the glycosylation profile of the lysosomal enzyme or lysosomal enzyme 
activator is altered by introducing and/or removing glycosylation sites in the amino acid sequence of 
the enzyme or activator, and producing the modified enzyme or activator under conditions providing 
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for the desired glycosylation. The glycosylation is described further below in the section entitled 
"Glycosylation". 

In a first aspect the polypeptide of the invention is selected from the group of lysosomal 
enzymes and lysosomal enzyme activators comprising at least one introduced glycosylation site as 
compared to a corresponding parent, preferably naturally-occurring, enzyme or activator. In other 
words, the polypeptide of the invention has an amino acid sequence that differs from that of a parent 
polypeptide in that it comprises at least one introduced glycosylation site. 

i) Introduction of glycosylation site in mature sequence 

In one embodiment the glycosylation site(s) is introduced into the amino acid sequence of 
the mature form of the parent lysosomal enzyme or activator. For instance, for modification of GCB 
the glycosylation site is introduced within the amino acid sequence shown in SEQ ID NO 1. For 
instance, for modification of SapC, the glycosylation site is introduced within the amino acid 
sequence shown in SEQ ID NO 3. 

The type of glycosylation site to be introduced is selected so as to provide the desired 
glycosylation profile. 

The glycosylation site may be an in vitro or in vivo glycosylation site. For instance, the in 
vitro glycosylation site is selected from the group consisting of the N-terminal amino acid residue of 
the polypeptide, the C-tenninal residue of the polypeptide, lysine, cysteine, arginine, glutamine, 
aspartic acid, glutamic acid, serine, tyrosine, histidine, phenylalanine and tryptophan, i.e. any of the 
attachment groups apparent from the table above in the definitions section. Of particular interest is 
an in vitro glycosylation site that is an epsilon-amino group, in particular as part of a lysine residue. 
Preferably, the glycosylation site is an in vivo glycosylation site. The introduction of an in vivo 
glycosylation site is normally performed by insertion, deletion or substitution of one or more amino 
acid residues that are selected so that a functional N- or O-glycosylation site is introduced into the 
amino acid sequence. Preferably, the amino acid residue(s) are inserted or substituted so that the 
resulting glycosylation site is located on the surface of the protein. For instance, it is desirable that 
the N-residue of an N-glycosylation site or the S or T residue of an O-glycosylation site is located at 
the surface of the polypeptide. Since charged amino acids are normally located on the surface of the 
protein, at least one of the amino acid residues to be modified in order to introduce a glycosylation 
site is preferably a charged amino acid residue or an amino acid residue located between position -4 
and +4 relative to a charged amino acid residue (i.e. up to four amino acid residues located towards 
the N-terrninal of the polypeptide relative to the charged amino acid residue, or up to 4 amino acids 
located towards the C-terminal of the polypeptide relative to the charged amino acid residue). Such 
residue is preferably selected from the group consisting of E, D, R, K, and H, and is most preferably 
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K. It is understood that one or more of the amino acid residues located between position -4 and +4 
relative to a charged amino acid residue may be modified in order to generate an in vivo (N- or O-) 
glycosylation site or an in vitro glycosylation site. 

Furthermore, in order to ensure efficient glycosylation it is preferred that the in vivo 
5 glycosylation site, in particular the N residue of the N-glycosylation site or the S or T residue of the 
O-glycosylation site, is located in the N-terminal part of the lysosomal enzyme or activator, 
preferably in the part which precedes (and thus is outside) the last 50 C-terminal residues of the 
polypeptide. Also of preference is to introduce the in vivo glycosylation site in a position wherein 
only one mutation is required to create the site (i.e. where any other amino acid residues required for 
10 creating a functional glycosylation site are already present in the polypeptide). Further 

considerations as to the choice of position for introduction of an additional glycosylation site include 
that the amino acid residue to be introduced is not conserved in amino acid sequences homologous 
to the wt lysosomal enzyme or activator and/or is not found in the relevant position of the mutated 
lysomal enzyme of any lysosomal storage disease patient. 

15 In order to increase the likelihood of the polypeptide being O-glycosylated it may be 

advantageous to introduce appropriate O-glycosylation sites into the polypeptide sequence. The 
peptide signal sequence for protein O-glycosylation is not fully characterized, although an in vitro 
study proposed that the sequence motif, XTPXP, serves as a signal for mucin-rype O-glycosylation. 
Asada et al. Glycoconj J 16(7):321-326, 1999 showed that the AATPAP sequence acts as an 

20 efficient O-glycosylation signal, in vivo in CHQ-cells. In yeast cells O-glycosylation of serine and 
threonine residues have been reported in many cases but with no clear consensus sequence for O- 
glycosylation. In one case a serine residue was O-glycosylated by inserting eight amino acid 
residues (TGRGDSPA) into lysozyme (Yamada et al., Biochemistry 33(13), 3885-3889, 1994). New 
introduced O-glycosylation sites may therefore also be chosen from these sequences. Furthermore, 

25 such sites can be constituted by serine and/or threonine rich regions, i.e. ammo acid regions 

comprising at least two serine and/or threonine residues in a stretch of 10 amino acid residues, in 
particular at least three, four, five or six such residues in a stretch of 10 amino acid residues, or at 
least two such residues in a stretch of 8, 6 or 4 amino acid residues. The O-glycosylation site is 
preferably introduced by substitution of one or more amino acid residues located in position -5 to 

30 +5, such as -4 to +4 of any of the N-residues listed above in connection with introduction of N- 
glycosylation sites. 

The in vivo glycosylation site is preferably an N-glycosylation site. N-glycosylation is a 
convenient way of achieving glycosylation, provides a desirable glycosylation profile when 
expressed in certain host cells, and is believed not to give rise to profound immunogenicity 
35 problems. 
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The polypeptide of the invention may comprise at least one introduced glycosylation site 
within the mature sequence, in particular 1-5 introduced glycosylation sites. 

ii) Introduction of glycosylation site by means of peptide addition 
5 Furthermore, in addition to or as an alternative to introducing glycosylation site(s) within 

the amino acid sequence of the mature lysosomal enzyme or lysosomal enzyme activator, additional 
glycosylation site(s) may be introduced by means of a peptide addition. In this case the polypeptide 
comprises or consists or consists essentially of the primary structure, 
NH 2 - X-P-COOH or NH 2 -P-X-COOH, 
10 wherein 

X is a peptide addition comprising or contributing to a glycosylation site, and P is the polypeptide to 
be modified, i.e. a lysosomal enzyme or activator thereof, e.g. a parent polypeptide as defined herein 
or a modified polypeptide having introduced and/or removed glycosylation sites in the mature part 
of the polypeptide. 

!5 In the context of a peptide addition the term "comprising a glycosylation site" is intended to 

mean that a complete glycosylation site is present in the peptide addition, whereas the term 
"contributing to a glycosylation site" is intended to cover the situation, wherein at least one amino 
acid residue of an N-glycosylation site is present in the peptide addition, whereas the other amino 
acid residue of said site is present in the polypeptide P, whereby the glycosylation site can be 

20 considered to bridge the peptide addition and the polypeptide. 

Usually, the peptide addition is fused to the N-terminal or C-terminal end of the polypeptide 
P as reflected in the above shown structure so as to provide an N- or C-terminal elongation of the 
polypeptide P. However, it is also possible to insert the peptide addition within the amino acid 
sequence of the polypeptide P whereby the polypeptide comprises, consists or consists essentially of 

25 the primary structure NH 2 -P x -X-P y -COOH, wherein 
P x is an N-terminal part of the relevant polypeptide P, 
P y is a C-tenninal part of said polypeptide P, and 

X is a peptide addition comprising or contributing to a glycosylation site. 

In order to minimize structural changes effected by the insertion of the peptide addition 
30 within the sequence of the polypeptide P, it is desirable that it be inserted in a non-structural part 
thereof. For instance, P* is a non-structural N-terminal part of a mature polypeptide P, and P y is a 
structural C-terminal part of said mature polypeptide, or P, is a structural N-terminal part of a 
mature polypeptide P, and P y is a non-structural C-terminal part of said mature polypeptide. 

The term "non-structural part" is intended to indicate a part of either the C- or N-terminal 
35 end of the folded polypeptide (e.g. protein) that is outside the first structural element, such as an a- 
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helix or a (3-sheet structure. The non-structural part can easily be identified in a three-dimensional 
structure or model of the polypeptide. If no structure or model is available, a non-structural part 
typically comprises or consists of the first or last 1-20 amino acid residues, such as 1-10 amino acid 
residues of the amino acid sequence constituting the mature form of the polypeptide. 
5 When the peptide addition comprises only few amino acid residues, e.g. 1-5 such as 1-3 

amino acid residues, and in particular 1 amino acid residue, the peptide addition can be inserted into 
a loop structure of the polypeptide P and thereby elongate said loop. 

m principle the peptide addition X can be any stretch of amino acid residues ranging from a 
single amino acid residue to a mature protein. Usually, the peptide addition X comprises 1-500 

10 amino acid residues, such as 2-500, normally 2-50 or 3-50 amino acid residues, such as 3-20 amino 
acid residues. The length of the peptide addition to be used for modification of the polypeptide P is 
dependent of or determined on the basis of a number of factors including the type of polypeptide to 
be modified and the desired effect to be achieved by the modification. The peptide addition may be 
designed by a site-specific or random approach, e.g as out-lined in further detail in the "Other 

IS Methods of the Invention" section below and as exemplified in the Examples section herein. 

Typically, the peptide addition X comprises 1-20, such as 1-10 glycosylation sites. For 
instance, the peptide addition X comprises 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 glycosylation sites. It is well 
known that one frequently occurring consequence of modifying an amino acid sequence of, e.g., a 
human protein is that new epitopes are created by such modification. Macromolecular moieties may 

20 be used to to shield any new epitopes created by the peptide addition, and therefore it is desirable 
that sufficient glycosylation sites (or attachment groups for any other desirable macromolecular 
moiety) are present to enable shielding of all epitopes introduced into the sequence. This is e.g. 
achieved when the peptide addition X comprises at least one glycosylation site within a stretch of 30 
contiguous amino acid residues, such as at least one glycosylation sites within 20 amino acid 

25 residues or at least one attachment group within 10 amino acid residues, in particular 1-3 attachment 
groups within a stretch of 10 contiguous amino acid residues in the peptide addition X. 

Thus, in one embodiment the peptide addition X comprises at least two glycosylation sites, 
wherein two of said amino acid residues are separated by at most 10 amino acid residues, none of 
which comprises the glycosylation site in question. 

30 Preferably, the glycosylation site of the peptide addition is an in vivo glycosylation site, 

preferably an N-glycosylation site. Accordingly, the peptide addition X comprises at least one N- 
glycosylation site, typically at least two N-glycosylation sites. For instance, the peptide addition X 
has the structure Xj-N-X 2 -T/S/C-Z, wherein X| is a peptide comprising at least one amino acid 
residue or is absent, X 2 is any amino acid residue different from P, and Z is absent or a peptide 

35 comprising at least one amino acid residue. For instance, Xj is absent, X 2 is an amino acid residue 
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selected from the group consisting of I, A, G, V and S (all relatively small amino acid residues), and 
Z comprises at least 1 amino acid residue. For instance, Z can be a peptide comprising 1-50 amino 
acid residues and, e.g., 1-10 glycosylation sites. 

Alternatively, Xi comprises at least one amino acid residue, e.g. 1-50 amino acid residues, 
5 X 2 is an amino acid residue selected from the group consisting of I, A, G, V and S, and Z is absent. 
For instance, Xi comprises 1-10 glycosylation sites. 

For instance, the peptide addition for use in the present invention can comprise a peptide 
sequence selected from the group consisting of INAT/S, GNIT/S, VNTT/S, SNTT/S, ASN1T/S, 
NIT/S, SPINAT/S, ASPINAT/S, ANIT/SANIT/SANI, ANTT/SGSNir/SGSNIT/S, 
10 ASNST/SNNGT/SLNAT/S, ANHT/SNET/SNAT/S, GSPINAT/S, ASPTNAT/SSPTNAT/S, 
ANNT/SNYT/SNWT/S, ATNTT/SLNYT/SANT/ST, AAN ST/SGNTT/SINGT/S, 
AVNWT/SSNDT/SSNST/S, GNAT/S, AVNWT/SSNDT/SSNST/S, ANNT/SNYT/SNST/S, and 
ANNTNYTNWT, wherein T/S is either a T or an S residue, preferably a T residue. 

The peptide addition can comprise one or more of these peptide sequences, i.e. at least two 
15 of said sequences either directly linked together or separated by one or more amino acid residues, or 
can contain two or more copies of any of these peptide sequence. It will be understood that the 
above specific sequences are given for illustrative purposes and thus do not constitute an exclusive 
list of peptide sequences of use in the present invention. 

In a more specific embodiment the peptide addition X is selected from the group consisting 
20 of INAT/S, GNIT/S, VNTT/S, SNTT/S, ASNTT/S, NTT/S, SPTNAT/S, ASPINAT/S, 

ANTr/SANIT/SANI, and ANrr/SGSNTT/S GSNTT/S, wherein T/S is either a T or an S residue, 
preferably a T residue. 

In one embodiment, the peptide addition X has an N residue in position -2 or -1, and the 
polypeptide P or P x has a T or an S residue in position +1 or +2, respectively, the residue numbering 
25 being made relative to the N-terminal amino acid residue of P or P„ whereby an N-glycosylation 
site is formed. For instance, the polypeptide has a T or S residue in position 2, preferably a T 
residue, and the peptide addition is AN or comprises AN as the C-tenninal amino acid residues. 

Removal of glycosylation site 
30 hi addition or as an alternative to introducing a glycosylation site it may be desirable to 

remove one or more glycosylation sites of the parent polypeptide, for instance if such glycosylation 

site is located at the catalytic site of a parent lysosomal enzyme and thus, when glycosylated, will 

lead to reduced or no enzymatic activity. Accordingly, the polypeptide of the invention may lack at 

least one glycosylation site present in the parent naturally-occurring enzyme or activator, typically a 

35 glycosylation site located in a functional site of the parent polypeptide such as a catalytic site of the 

lysosomal enzyme. The glycosylation site to be removed may be an in vivo or in vitro glycosylation 
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site. When removing a glycosylation site this is preferably done by substitution, preferably to a 
conservative substitutions. Conservative substitution tables providing functionally similar amino 
acids are well known in the art. The table below sets forth six groups which contain amino acids 
that are "conservative substitutions" for one another. 



1 


Alanine (A) 


Serine (S) 


Threonine (T) 


2 


Aspartic acid (D) 


Glutamic acid (E) 




3 


Asparagine (N) 


Glutamine (Q) 




4 


Arginine (R) 


Lysine (K) 




5 


Isoleucine (I) 


Leucine (L) 


Methionine (M) Valine (V) 


6 


Phenylalanine (F) 


Tyrosine (Y) 


Tryptophan (W) 



Number of glycosylation sites 

Irrespectively of how additional glycosylation sites are provided (whether in the mature part 
of the polypeptide or by means of a peptide addition), the polypeptide of the invention normally 
comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more introduced glycosylation sites, in 
particular N-glycosylation sites, the upper limit being determined by the number of introduced 
glycosylation sites that can be introduced without substantially reducing the in vivo activity of the 
resulting polypeptide. Preferably, the polypeptide comprises 2-10 introduced glycosylation sites, e.g. 
at least 2-3 introduced glycosylation sites, such as 4-5 introduced glycosylation sites, in particular 
N-glycosylation sites. Analogously, 0-15 glycosylation sites may have been removed from the 
parent polypeptide, typically 0-5. The total number of glycosylation sites present in the polypeptide 
of the invention is normally in the range of 1-20, such as 3-15. For instance, the polypeptide of the 
inventon comprises 1, 2, 3, 4, 5, 6, 7, 8, 9. 10, 11, 12, 13, 14, 15 or more glycosylation sites. 

Chimeric polypeptides 

In a further aspect the invention relates to a chimeric polypeptide comprising a lysosomal enzyme 
unit linked to one or more units of an activator of said enzyme. The term "unit" is intended to 
indicate a polypeptide having the activity of the enzyme or activator, respectively. For instance, a 
lysosomal enzyme.unit comprises the amino acid sequence of the mature lysosomal enzyme, in case 
of GCB, e.g. the amino acid sequence of SEQ ID NO 1, optionally modified by one or more amino 
acid changes. Likewise, an activator unit comprises, e-g., the amino acid sequence of a mature 
activator, in the case of SapC, e.g. the amino acid sequence of SEQ ID NO 3, optionally modified by 
one or more amino acid changes. 

The enzyme and/or activator constituents of the chimeric polypeptide may be any 
polypeptide exhibiting the relevant lysosomal enzyme or activator activity. For instance, the 
lysosomal enzyme constituent is a wt lysosomal enzyme or a variant or functional fragment thereof, 
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or a modified lysosomal enzyme as described herein having introduced glycosylation site(s). 
Analogously, the activator may be a wt lysosomal enzyme activator or a variant or functional 
fragment thereof, or a modified activator as described herein having introduced glycosylation site(s). 

While the enzyme and activator units may be linked by any type of linkage, in particular a 
covalent linkage, such as by chemical cross-linking using cross-linking agents known in the art, or 
by di-sulphide bridges, it is particularly preferred that the polypeptide constituents are linked via a 
peptide bond or a peptide linker (and thus that the chimeric polypeptide is a fusion polypeptide). If 
used, the linker peptide must be of a type (length, amino acid composition, amino acid sequence, 
etc) that is adequate to link the two (or more) polypeptide constituents in such a way that they 
assume a conformation relative to one another so that the resulting polypeptide has the relevant 
lysosomal enzyme activity. Furthermore, the linker peptide is typically designed to increase the 
stability of the polypeptide towards proteolytic degradation, e.g by use of special amino acid 
sequences or residues. The peptide linker sequence may comprise one or more glycosylation sites. 
For instance, the linker can contain the sequence NAT providing an N-glycosylation site. 

The linker may, e.g., be 0-50 amino acid residues long. For instance, the linker peptide 
predominantly includes the amino acid residues Gly, Ser, Ala or Thr. A typical linker comprises 1- 
30 amino acid residues, such as a sequence of about 2-20 or 3-15 amino acid residues. The amino 
acid residues selected for inclusion in the linker peptide should exhibit properties that do not 
interfere significantly with the activity of the chimeric polypeptide. Thus, the linker peptide should 
on the whole not exhibit a charge which would be inconsistent with the lysosomal enzyme activity 
of the chimeric polypeptide, or interfere with internal folding, or form bonds or other interactions 
with amino acid residues in one or more of the polypeptide constituents which would seriously 
impede the binding of the chimeric polypeptide to the mannose receptor. 

Specific linkers for use in the present invention may be designed on the basis of known 
naturally occurring as well as artificial polypeptide linkers (see, e.g., Hallewell et al. (1989), J. Biol. 
Chem. 264, 5260-5268; Alfthan et al. (1995), Protein Eng. 8, 725-731; Robinson & Sauer (1996), 
Biochemistry 35, 109-116; Khandekar et al. (1997), J. Biol. Chem. 272, 32190-32197; Fares et al. 
(1998), Endocrinology 139, 2459-2464; Smallshaw et al. (1999), Protein Eng. 12, 623-630; US 
5,856,456). For instance, linkers used for creating single-chain antibodies, e.g. a 15mer consisting of 
three repeats of a Gly-Gly-Gly-Gly-Ser amino acid sequence ((Gly 4 Ser) 3 ), are contemplated to be 
useful in the present invention. Furthermore, phage display technology as well as selective infective 
phage technology can be used to diversify and select appropriate linker sequences (Tang et al., J. 
Biol. Chem. 271, 15682-15686, 1996; Hennecke et al. (1998), Protein Eng. 11, 405^10). Also, the 
Arc repressor phage display has been used to optimise the linker length and composition for 
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increased stability of the single-chain protein (Robinson and Sauer (1998), Proc. Natl. Acad. Sci. 
USA 95, 5929-5934). 

Another way of obtaining a suitable linker is by optimizing a simple linker - e.g. 
((Gly 4 Ser)„) - through random mutagenesis. 
5 It will be clear from the present specification that whatever the nature of the linker, it 

should be one which is not readily susceptible to cleavage by e.g. proteases or chemical agents, 
since cleavage of the chimeric polypeptide to result in its polypeptide constituents is not desired in 
the present context. 

Id a further aspect the invention relates to a chimeric polypeptide comprising a lysosomal 
10 enzyme unit linked to one or more second polypeptide units, the second polypetide being capable of 
targeting phagocytic cells, preferably macrophages or macrophage like cells. The term "polypeptide 
targeting" is intended to indicate a polypeptide that is recognized and taken up by receptors present 
on phagocytic cells. Preferably, the lysosomal enzyme unit and the second polypeptide unit(s) are 
linked by a peptide bond or a peptide linker, 

1 5 Examples of targeting polypeptides include the Fc region of immunoglobulins. Three 

classes of receptors for the'Fc region of IgG have been identified in mice and humans (for a review 
see Fridman et al. Immunological Reviews 125, 49-76, 1992). The Fc receptor, FcyRI, bind 
monomelic IgG with high affinity and this receptor is found on monocytes, neutrophils and 
macrophages. The FcyR receptors mediate a large spectrum of functions. In macrophages they 

20 enable phagocytosis of IgG-coated particles, endocytosis of immune complexes to lysosomes 
(TJkkonen et al. J. Exp. Med. 163, 952-971, 1986) etc. A chimeric polypeptide comprising a 
lysosomal enzyme and the Fc part of IgG may therefore result in specific targeting of the chimeric 
polypeptide to macrophages by FcyR mediated endocytosis and may therefore be used in treatment 
of the relevant lysosomal storage disease, such as Gaucher' s disease. Examples of chimeric 

25 polypeptides comprising Fc and a second polypeptide are described by Liu et al., Biochem. Biophys. 
Res. Comm. 197, 1094-1 102, 1993, Dwyer et al., J. Biol. Chem. 274, 9738-9743, 1999 or Wang et 
al., Protein Engineering, 7, 715-722, 1994. Instead of a chimeric polypeptide either a monoclonal or 
polyclonal antibody against the lysosomal enzyme may be coadimnistered with the enzyme and 
result in Fc mediated uptake into macrophages. 

30 Similarly may other receptors that are relative specific for macrophages be used for uptake 

of the lysosomal enzyme, such as GCB, by fusing the enzyme with the ligand for the receptor. 
Examples of such ligands are chemokines targeting a chemokine receptor specific for macrophages 
or lipoprotein targeting the scavenger receptor. 

The chimeric polypeptide comprising the lysosomal enzyme and the second polypeptide 

35 may further comprise one or more units of an activator for the lysosomal enzyme in question. 
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The chimeric polypeptide of the invention may comprise more than one unit of the activator 
for the lysososomal enzyme and may comprise more than one type of activator. Typically, the 
chimeric polypeptide comprises 1-5 units of the activator. The order of activator and lysosomal 
enzyme is not believed to be critical and thus the activator may be added N- and/or C-terminally to 
5 the lysosomal enzyme, or within a non-structural part thereof. 

Specific chimeric polypeptides of the invention 

In a specific embodiment the lysosomal enzyme unit of a chimeric polypeptide of 

the invention is a GCB polypeptide. Thus, for instance, the chimeric polypeptide comprises a GCB 

10 polypeptide and at least one unit of a targeting polypeptide and/or at least one unit of a GCB 

activator (i.e. a polypeptide that is capable of increasing the in vivo activity of the GCB 

polypeptide). For instance, the targeting polypeptide is Fc and/or the GCB activator is SapA or 

SapC, preferably SapC. The chimeric polypeptide can comprise, e.g. i-5 GCB activator units, of 

which at least one is preferably SapC. For instance, the chimeric polypeptide comprises 1, 2, 3, or 4 

15 units of SapC and 0, 1 or 2 units of SapA. 

The activator may be located N-terminally or C-terminally to the GCB polypeptide. Specific 

examples of a chimeric polypeptide according to this embodiment are chimeric polypeptides 

comprising the following structure: 

GCB-SapA-SapC, SapA-GCB-SapC, SapC-GCB-SapA, SapC-GCB-SapC, wherein, preferably, the 
20 units are linked by a peptide bond or peptide linker as described elsewhere herein. 

It will be understood that the chimeric polypeptids described in this section exhibits GCB 
activity, and when relevant further has the activity of SapC. 

The GCB polypeptide unit may be a wtGCB or a functional fragment or variant thereof as 
described herein. In particular, the GCB polypeptide may be a GCB polypeptide of the invention as 
25 described herein. For instance, a fragment of wildtype or mutant GCB can be used, which lacks at 
least one, e.g. 1-20, such as 1-10 amino acid residues at the C-terminus (when the GCB is positioned 
at the N-terminal part of the chimeric polypeptide and/or is linked to an activator in its C-terminal 
end) or N-terminus (when the GCB is positioned at the C-terminal part of the chimeric polypeptide 
or linked to an activator in its N-terminal end). 
30 Other examples of chimeric polypeptides of the invention include a chimeric polypeptide 

comprising an Arylsulphatase A unit and at least one unit of Fc or SapB, e.g. 1-5 copies added at the 
N- and/or C-terminal of the lysosomal enzyme, and a chimeric polypeptide comprising an alpha- 
galactosidase unit and at least one unit of Fc or Sap B and/or SapD. e.g. 1-5 copies added at the N- 
and/or C-terminal of the alpha-galactosidase unit 

35 

The parent polypeptide 
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The parent polypeptide to be modified in accordance with the general principle outlined above may 
be any lysosomal enzyme or lysosomal enzyme activator. Preferably, the lysosomal enzyme or 
activator is one that binds to a mannose receptor or a mannose-6-phosphate receptor. Examples of 
such lysosomal enzymes include of glucocerebrosidase (GCB), oc-L-iduronidase, acid ct- 
5 glucosidase, ot-galactosidase, acid sphingomyelinase, galactocerebrosidase, arylsulphatase A, 

sialidase, and hexosaminidase. Examples of activators include SapA, SapB, SapC, SapD, and GM-2 
activator (the latter activates hexosaminidase). These enzymes and activators are well-known in the 
art and the skilled person will be aware of how to clone the genes encoding these enzyme for use in 
modification according to the present invention. 

to 

A GCB polypeptide of the invention 

In a preferred embodiment the lysosomal enzyme to be modified is a GCB polypeptide, and thus the 
polypeptide of the invention is a GCB polypeptide. 

The present application is believed to be the first disclosure of a modified GCB polypeptide 

15 that has an amino acid sequence that differs from that of a wtGCB polypeptide by at least one amino 
acid residue, and has an increased in vivo activity relative to said wtGCB 

In particular, the present application is believed to constitute the first disclosure of a GCB 
polypeptide comprising an amino acid sequence that differs from that of a parent GCB polypeptide 
in that at least one amino acid residue comprising an attachment group for a macromolecular moiety 

20 has been introduced or at least one amino acid residue comprising an attachment group for a 

macromolecular moiety has been removed, in order to render the polypeptide more susceptible to 
conjugation to such macromolecular moiety. The term "differs" as used in the present application is 
intended to allow for additional differences being present. Such GCB polypeptide is of particular 
interest for preparing a conjugated polypeptide, further comprising at least one covalently attached 

25 macromolecular moiety of a type capable of attaching to the introduced or removed amino acid 
residue. 

Of particular interest is a GCB polypeptide comprising the modifications described above in 
the section entitled "introduction of glycosylation site(s)". Accordingly, in one embodiment the 
GCB polypeptide is a glycosylated GCB polypeptide, which comprises at least one introduced 
30 glycosylation site as compared to a parent GCB polypeptide (whether it be in the mature part of the 
GCB polypeptide or as a peptide addition thereto). 

In one embodiment, the parent GCB polypeptide to be modified according to the invention 
comprises or is constituted by an amino acid sequence that corresponds to that of a wtGCB, in 
particular the sequence shown in SEQ ID NO I in which the amino acid residue located in position 
35 495 is either H or R, or a variant or functional fragment thereof. Thus, the GCB polypeptide of the 
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invention may comprise or be a sequence of amino acids corresponding to the sequence of a wtGCB 
except for the modification(s) introduced into the sequence in accordance with the invention. 

For convenience, the wtGCB having the amino acid sequence shown in SEQ ID NO 1 is 
used as the backbone for the modifications disclosed in the present section. However, it will be 
understood that other GCBs may constitute parent GCB polypeptides to be modified in accordance 
with the invention. Such parent polypeptides are conveniently modified in positions, which are 
equivalent to those identified in SEQ ID NO 1. An "equivalent position" is intended to indicate a 
position in the amino acid sequence of a given GCB, which is homologous (i.e. corresponding in 
position in either primary or tertiary structure) to a position in the amino acid sequence shown in 
SEQ ID NO 1. The "equivalent position" is conveniently determined on the basis of an alignment of 
members of the GCB sequence family, e.g. using the program CLUSTALW version 1.74 using 
default parameters (Thompson et aL, 1994, CLUSTAL W: improving the sensitivity of progressive 
multiple sequence alignment through sequence weighting, position-specific gap penalties and weight 
matrix choice, Nucleic Acids Research, 22:4673-4680) or from published alignments. For instance, 
O'Neill et al., PNAS 86, 5049-5053, 1989 discloses an alignment of human and murine GCB genes. 

When the attachment group to be introduced is a glycosylation site the modified GCB of the 
invention can be produced with an increased glycosylation as compared to that achievable through 
the four native N-glycosylation sites of wtGCB . 

For instance, in order to introduce an N-glycosyiation site into a parent GCB polypeptide of 
the invention the polypeptide comprises one or more substitutions, relative to the amino acid 
sequence shown in SEQ ID NO:l or an equivalent position of another backbone, selected from the 
group consisting of K7N+F9T, K7N+*9T, K7N+*9S (*9T and *9S represent an insertion of a 
threonine and serine residue, respectively, between amino acid residues S8 and F9), K7N+F9S, 
K74N+Q76T, K74N+Q76S, K77N+K79T, K77N+K79S, K79N+F81T, K79N+F81S, 
K106N+Y108T, K106N+Y108S, K155N+K157T, K155N+K157S, K157N+P159T, KI57N+P159S, 
K186N+N188T, K186N+N188S, K193N+S195T, K194N, K194T, K198N+Q200T, 
K198N+Q200S, K215N+L217T, K215N+L217S, E222N+K224T, K224N+Q226T, K224N+Q226S, 
K293N+L295T, K293N+L295S, K303N+V305T, K303N+V305S, K321N, K321N+T323S, 
K346N+W348T, K346N+W348S, K408N, K408N+T410S, K413N+P415T, K413N+P415S, 
K425N+I427T, K425N+I427S, K441N+D443T, K441N+D443S, K466N+V468T, K466N+V468S, 
K473N+P475T and K473N+P475S. 

Additionally or alternatively, the polypeptide may comprise a substitution to an 
asparagine residue in one or more of the positions selected from the group consisting of P6, G10, 
Yll, C23, T36, Y40, T43, E50, A95, L105, Y108, M133, D137, P171, L175, W179, K194, H206, 
L240, A269, E235, F337, V343, E349, L354, Q362, S364, V398, H422, E429, V437, D453, R463, 
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T482, G486, P28, L34, E41, T61, L66, A84, 1130, T132, A136, S181, E152, P178, L185, H206, 
G255, A291, G250, V295, K321, G325, P332, B67, G377, D405, K408, P465, L480 and 1489 of 
the amino acid sequence shown in SEQ ID NO:l. 

A preferred polypeptide of the invention comprises at least one of the following sets of 
5 mutations or any other specific mutations listed in Table 3 in Example 6 below. 
K194N; 

K224N+Q226T; 
E41N; 

E222N+k224T; 
10 K303N+V305T; 

E41N+K194N+K224N+Q226T; 

K194N+E222N+K224T+K303N+V305T; 

E41N+K194N+K224N+Q226T+K303N+V305T; 

D153N+K155T; 
15 R163N+L165T; 

T132N; and/or 

I130N 

Of the above mentioned specific mutants those are preferred which are outside the last 50 C- 
terminal amino acid residue of the parent GCB polypeptide and/or requires only one substitution to 

20 introduce an in vivo glycosylation site. 

For instance, in order to introduce an in vitro glycosylation site into a parent GCB 
polypeptide, an amino acid residue constituting an in vitro glycosylation site, preferably a lysine 
residue, is introduced into one or more positions, relative to the amino acid sequence shown in SEQ 
ID NO: 1 or an equivalent position of another GCB backbone, selected from the group consisting of 

25 R2, R39, R44, R47, R48, R120, R131, R163, R170, R211, R257, R262, R277, R285, R339, R353, 
R359, R395, R433, R463, R495, R496, H60, H145, H162, H206, H223, H255, H273, H274, H290, 
H306, H311, H328, H365, H374, H419, H422, H451, H490, D24, D27, D87, D127, D137, D140, 
D141, D153, D203, D218, D258, D263, D282, D283, D298, D358, D380, D399, D405, D409, 
D443, D445, D453, D467, D474, E41, E50, E72, Bill, El 12, E151, E152, E222, E233, E235, 

30 E254, E300, E326, E340, E349, E388, E429, and E481. In vitro glycosylation sites other than lysine 
may be introduced in the same positions. 

The GCB polypeptide of the invention having at least one introduced in vitro glycosylation 
site may have been further modified in that an in vitro glycosylation site present in the parent GCB 
polypeptide has been removed, e.g. to reduce the number of glycosylation sites to avoid too 

35 extensive glycosylation. For instance 1-5 such sites may be removed. The in vitro glycosylation site 
to be removed is e.g. located at a function site. In the present context the term "functional site" is 
intended to indicate one or more amino acid residues which is/are essential for or otherwise involved 
in the function or performance of GCB. Such amino acid residues are "located at" the functional 
site. The functional site may be determined by methods known in the art. Amino acid residues E340 
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and E235 of SEQ ID NO 2 have been found to be part of a functional site of wt human GCB, and 
any amino acid residue of the parts of SEQ ID NO 2 defined by amino acid residues 336-344 and 
231-239 are contemplated to be located at a functional site. 

For instance, when the in vitro glycosylation site is a lysine residue, a lysine residue present 
5 in the parent GCB can be substituted with another amino acid residue, preferably arginine, or 

deleted. For instance, at least one of the lysine residues located in a position selected from the group 
consisting of K7, K74, K77, K79, K106, K155, KI57, K186, K193, K197, K215, K224, K293, 
K303, K321, K346, K408, K413, K425, K441, K466 and K473 of the amino acid sequence shown 
in SEQ ID NO: 1 has been replaced with another amino acid residue, in particular a lysine residue, or 
10 deleted. 

In yet another embodiment the GCB polypeptide of the invention has been modified so as to 
obtain reduced susceptibility to proteolytic degradation. It is presently contemplated that a 
proteolytic cleavage site is located around amino acid residue 136 of wtGCB. Accordingly, in one 
embodiment the A GCB polypeptide of the invention comprises a modification at any of amino acid 

15 residues 132-139 relative to SEQ ID NO 1, resulting in reduced susceptibility to proteolytic 
degradation. One convenient way of achieving shielding of a proteolytic site is by use of a 
macromolecular moiety, in particular a polymer or an oligosaccharide moiety. For this purpose, the 
GCB polypeptide according to this embodiment may be modified so as to have introduced an 
attachment group for said moiety (e.g. a glycosylation site) into an equivalent position of the parent 

20 GCB polypeptide relative to amino acid residues 132-1 39 of SEQ ID NO 1 . For instance, an N- 
glycosylation site is introduced so that the N-residue of said site occupies any of positions 132-139. 
Alternatively, a proline is introduced into any such position. Specific mutations believed to provide 
reduced proteolytic cleavage include: A136N, A135P or A136P. 



25 A modified SapC polypeptide of the invention 

In another embodiment the lysosomal enzyme activator to be modified in accordance with the 
invention is SapC. In particular, the parent SapC polypeptide has the sequence shown in SEQ ID 
NO 3. While the parent SapC may be modified to introduce any attachment group for a 
macromolecular moiety, it is presently preferred that it be modified by introduction of a 

30 glycosylation site, in particular an in vivo glycosylation site such as an an N-glycosylation site. In 
this case the SapC polypeptide of the invention may comprise at least one mutation selected from 
the group consisting of S1N+V3T/S, D2N+Y4T/S, K13N+V15T/S, E14N, K17N+I19T/S, 
I19N+N21T/S, E25N+E27T/S, K26N+I28T/S, D30N+F32T/S, D33N+M35T/S, K38N+P40T/S, 
S42N, S44N+E46T/S, and V5 IN (relative to SEQ ID NO 3), wherein T/S indicates a threonine or a 

35 serine residue, preferably a threonine residue. 
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SapC has been expressed recombinantly in & coli (Qi et al., J. Biol. Chem. 269, 16746- 
16753, 1994), but apparently not in glycosylating host cells. Accordingly, in a further aspect the 
invention relates to a recombinant glycosylated SapC polypeptide. The glycosylated SapC 
polypeptide may be wtSapC or a variant or functional fragment thereof of a modified SapC 
5 polypeptide as described in the present application. 

Preferably, the SapC polypeptide of the invention has at least one of the following 
properties: 

It enhances the in vivo activity of endogenous glucocerebrosidase activity, 
It enhances the in vivo activity of glucocerebrosidase in a patient to which glucocerebrosidase has 
10 been administered, 

It exhibits an increased uptake in phagocytic cells, preferably macrophages or macrophage like cells, 
It exhibits increased activity or functional in vivo half-life in Iysosomes or under conditions 
mimicking lysosomal conditions, and/or 
It increases an in vitro bioactivity of glucocerebrosidase. 
15 The Methods section comprises suitable assays for determing such activities. 

The SapC polypeptide according to the invention finds particular use in therapy, alone or in 
combination with GCB (wtGCB or a commercially available GCB or a GCB polypeptide of the 
present invention), or as a constituent of a chimeric polypeptide of the invention. 

20 Glycosylation 

In most cases, the polypeptide of the invention is glycosylated (Le. comprises an in vivo attached N- 
or O-linked oligosaccharide moiety or in vitro attached oligosaccharide moiety) and furthermore has 
an altered glycosylation profile as compared to that of the parent polypeptide. For instance, the 
altered glycosylation profile is a consequence of an altered, normally increased, number of attached 
25 oligosaccharide moieties and/or an altered type of attached oligosaccharide moieities. 

The type of oligosaccharide moiety should normally be one that exhibits sufficient affinity 
for or uptake by a mannose receptor, thereby enabling the glycosylated polypeptide of the invention 
to exhibit improved affinity for or uptake by such receptor. 

In the present context the term "mannose receptor" is intended to indicate any mannose receptor of 
30 interest in the present invention, including, in particular, a macrophage mannose receptor (of 
relevance for GCB) and a mannose-6-phosphate receptor (of relevance for some of the other 
lysosomal enzymes). Such improved affinity for or uptake by the mannose receptor is expected to 
result in increased uptake in phagocytic cells, preferably monocytes, macrophages (e.g. Kupffer 
cells, glia/mikroglia, alveolar phagocytes, reticulum cells, or other peripheral macrophages) or 
35 macrophage like cells (for instance osteoclasts, dendritic cells, or astrocytes). Also, increased 
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lysosomal activity of the polypeptide is expected. Consequently, increased in vivo activity of the 
polypeptide and thereby increased therapeutic utility may result. 

Furthermore, the type of oligosaccharide moiety to be attached should normally be one that 
does not lead to increased immunogenicity of the modified polypeptide as compared to that of the 
S parent polypeptide, but rather equal or reduced immunogenicity as compared to the parent, in 
particular when the glycosylated lysosomal enzyme or activator is to be used in therapy. 

The oligosaccharide moiety is preferably one provided by in vivo glycosylation. In order to 
achieve in vivo glycosylation of a polypeptide which has been modified by introduction of one or 
more glycosylation sites as described above, a nucleotide sequence encoding the polypeptide should 

10 be inserted in a glycosylating, eucaryotic expression host. The expression host cell may be selected 
from fungal (filamentous fungal or yeast), insect or animal cells or from transgenic plant cells. Also, 
the glycosylation may be achieved in the human body when using a nucleotide sequence encoding 
the polypeptide of the invention in gene therapy. Insect cell mediated in vivo N-glycosylation has 
proven to be of particular relevance for the present invention. Expression of the polypeptide in any 

1 5 of the above host cells may also result in the polypeptide being O-glycosylated at one or more serine 
or threonine residues. 

It will be apparent from the description above that to obtain an improved uptake by the 
mannose receptor, at least one oligosaccharide chain of the glycosylated polypeptide of the 
invention comprises at least one exposed mannose residue. The term "mannose residue" is used 

20 generally about any functional mannose-based derivative, such as a mannosyl residue and a 
mannpsyl phosphate group, capable of binding to a mannose receptor. The term "exposed" is 
intended to indicate that the oligosaccharide chain terminates with a mannose residue or that the 
mannose residue is located in such a position in the 3-D structure of the polypeptide, that it is readily 
available to bind with a mannose receptor protein. More preferably, when the polypeptide 

25 comprises more that one oligosaccharide chain, at least 50% of such chains, in particular at least 
75% or all of such chains comprises at least 1 exposed mannose residue, in particular at least 2 
exposed mannose residues, more preferably at least 3 exposed mannose residues, e.g. 1-5 exposed 
mannose residues. For instance, at least one, such as two, three or all of the oligosaccharide chains 
comprises 2, 3, 4, 5 or 6 exposed mannose residues. 

30 In addition to exposed mannose residues the oligosachharide chain(s) of the glycosylated 

polypeptide of the invention may comprise additional, non-exposed mannose residues. For instance, 
at least one of the oligosaccharide chains comprises 1-20 non-exposed mannose residues, such as 2- 
10 non-exposed mannose residues. 
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Examples of preferred oligosaccharide structures with exposed mannose residues are shown 
in Fig. 1 of US 5,236,838, the contents of which are incorporated herein by reference, as well as in 
the Examples section herein. 

Expressed differently, the glycosylated polypeptide of the invention comprises at least one 
5 N-linked oligosaccharide chain being of the high mannose type (as defined in US 5,218,092 or in 
Fig. 2 of Gemmill et al., Biochimica et Biophysica Acta 1426 (1999) 227-237, the contents of which 
are incorporated herein by reference). Expression b insect cells and in yeast cells has been found to 
provide glycosylated polypeptides with such oligosaccharides (see the examples herein). 
Furthermore, the polypeptide may comprise at least one O-linked oligosaccharide, e.g. having any of 
10 the structures disclosed in Fig. 3 of Gemmill et al., Biochimica et Biophysica Acta 1426 (1999) 227- 
237. 

In one embodiment, in addition to mannose residues the glycosylated polypeptide of the 
invention may comprise at least one fucose residue. In another embodiment the glycosylated 
polypeptide is free of fucose, since, sometimes, fucose gives rise to immunogenicity. A fucose 
15 residue may be removed by subjecting the glycosylated polypeptide comprising such residue to 
treatment with a fucosidase and recovering the resulting fucose free glycosylated polypeptide. 

In particular, a polypeptide of the invention comprises at least one oligosaccharide moiety 
with the following structure: 

20 Asn-N-N-M-M 2 
F 

wherein Asn indicates the Asn residue of the polypeptide to which the oligosaccharide chain 
is attached, N an N-acetylglucosamine residue, F a fucose residue which may or may not be present 
and M-M 2 three mannose residues, two of which are linked to the same third mannose residue. 

25 Other preferred oligosaccharide structures are any of the oligosaccharides described in the Examples 
section hereinafter, or any of the structures shown in Fig. 8. Such structures may be provided by N- 
glycosylation or by in vitro glycosylation. 

The nature and number of oligosaccharide moieties of a glycosylated polypeptide of the 
invention may be determined by a number of different methods known in the art e.g.by lectin 

30 binding studies (Reddy et al., 1985, Biochem. Med. 33: 200-210; Cummings, 1994, Meth. Enzymol. 
230: 66-86; Protein Protocols (Walker ed.), 1998, chapter 9); by reagent array analysis method 
(RAAM) sequencing of released oligosaccharides (Edge et al., 1992, Proc. Natl. Acad. Sci. USA 89: 
6338-6342; Prime et al., 1996, J. Chrom. A 720: 263-274); by RAAM sequencing of released 
oligosaccharides in combination with mass spectrometry (Klausen, et al., 1998, Molecular 

35 Biotechnology 9: 195-204); or by combining proteolytic degradation, glycopeptide purification by 
HPLC, exoglycosidase degradations and mass spectrometry (Krogh et al, 1997, Eur. J. Biochem. 
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244: 334-342). Specific methods for determining the glycosylation profile is described in the 
examples section hereinafter. 

When the polypeptide is expressed in glycosylating host cells, which do not naturally 
provide exposed mannose residues (e.g. a mammalian cell), the glycosylated polypeptide of the 
invention is preferably subjected to enzymatic treatment subsequent to its expression to remove non- 
mannose sugar residues. The enzymatic treatment may, e.g., be as described in US 5,549,892, the 
contents of which are incorporated herein by reference. 

A polypeptide of the invention comprising the above defined exposed and/or non-exposed mannose 
residues may be obtained by in vitro glycosylation, e.g. utilizing available attachment groups on the 
wild-type or modified polypeptide. Chemically synthesized oligosaccharide structures can be 
attached to the polypeptide using a variety of different chemistries e.g. the chemistries employed for 
attachment of PEG to proteins, wherein the oligosaccharide is linked to a functional group, 
optionally via a short spacer (see the section entitled Conjugation to a Non-Oligosaccharide 
Macromolecular Moiety). The in vitro glycosylation can be carried out in a suitable buffer at pH 4-7 
in protein concentrations of 0.5-2 mg/ml and a volume of 0.02-2 ml. The activated mannose 
compound is present in 2-200 fold molar excess, and reactions are incubated at 4-25°C for periods of 
0.1-3 hours. In vitro glycosylated GCB polypeptides are purified by dialysis and standard 
chromatographic techniques. 

Other in vitro glycosylation methods are described, for example in WO 87/05330, by Aplin 
etl al., CRC Crit Rev. Biochem., pp. 259-306, 1981. Furthermore, Doebber et al., J. Biol. Chem., 
257, pp2193-2199, 1982, the contents of which are incorporated herein by reference, describe a 
convenient method for attaching a synthetic Man3Lys2 glycopeptide to lysine residues by in vitro 
glycosylation. However, coupling of a lysine residue may result in increased immunogenicity of the 
resulting polypeptide, and may not always be desireabie for the present purpose. 

Furthermore, in vitro glycosylation to protein- and peptide-bound Gin-residues can be 
carried out by transglutaminases (TGases). Transglutaminases catalyse the transfer of donor amine- 
groups to protein- and peptide-bound Gin-residues in a so-called cross-linking reaction. The donor- 
amine groups can be protein- or peptide-bound e.g. as the e-amino-group in Lys-residues or it can be 
part of a small or large organic molecule. An example of a small organic molecule functioning as 
amino-donor in TGase-catalysed cross-linking is putrescine (1,4-diaminobutane). An example of a 
larger organic molecule functioning as amino-donor in TGase-catalysed cross-linking is an amine- 
containing PEG (Sato et al., Biochemistry 35, 1996, 13072-13080). 

TGases, in general, are highly specific enzymes, and not every Gin-residues exposed on the 
surface of a protein is accessible to TGase-catalysed cross-linking to amino-containing substances. 
In order to render a protein susceptible to TGase-catalysed cross-linking reactions stretches of amino 
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acid sequence known to function very well as TGase substrates are inserted at convenient positions 
in the amino acid sequence encoding a GCB polypeptide. Several amino acid sequences are known 
to be or to contain excellent natural TGase substrates e.g. substance P, elafin, fibrinogen, 
fibronectin, o^-plasmin inhibitor, a-caseins, and ^-caseins and may thus be inserted into and thereby 
constitute part of the amino acid sequence of a polypeptide of the invention. 

Normally, the glycosylated polypeptide of the invention comprises 1-15 oligosaccharide 
moieties, such as 1-10 or 1-6 oligosachharide moieties. 

The glycosylated polypeptide of the invention may further comprise at least one non- 
oligosaccharide macromolecular moiety, such as a polymer molecule, e.g. PEG, attached to an 
attachment group present in the parent polypeptide or having been introduced (as described in the 
section entitled "Conjugation to a non-oligosaccharide macromolecular moiety"). 

Conjugation to a non-oligosaccharide macromolecular moiety 

In the present application focus has been made to modify lysosomal enzyme and lysosomal 
enzyme activators by introduction of additional glycosylation sites. However, the invention is not 
limited to modification of glycosylation sites only. Also included in the invention is modification of 
amino acid residues constituting an attachment group for any other suitable (non-oligosaccharide) 
macromolecular moiety, in particular a polymer moiety such as PEG. It will be understood that the 
same principles for introducing/removing attachment groups for PEG etc apply as has been 
described above for introduction/removal of glycosylation site. In particular, in connection with 
introducing/removing in vitro glycosylation sites, since such sites may also function as attachment 
group for non-oligosaccharide macromolecular moieties such as PEG. 

Accordingly, in one aspect the polypeptide of the invention is a lysosomal enzyme or 
lysosomal enzyme activator that comprises an amino acid sequence that differs from that of a parent 
enzyme or activator by at least one introduced and/or at least one removed amino acid residue 
comprising an attachment group for a non-oligosaccharide macromolecular moiety, the introduction 
and/or removal of the attachment group being done analogously to that described in the sections 
"Introduction of a glycosylation site" and "Removal of a glycosylation site". Thus, for instance, the 
attachment group may be introduced into the mature part of the polypeptide or by means of a 
peptide addition on the basis of the same principles as those described above for introduction of a 
glycosylation site. The polypeptide according to this aspect is preferably a conjugated polypeptide 
comprising at least one non-oligosaccharide macromolecular moiety attached to the relevant 
attachment group. The conjugated polypeptide may further comprise at least one oligosaccharide 
moiety (e.g. as a consequence of in vivo or in vitro glycosylation). The polypeptide according to this 
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embodiment may be any of the glycosylated polypeptides described herein, or may be one that does 
not contain an additional glycosylation site {relative to the parent polypeptide). 

The type of macromolecular moiety is selected on the basis of the effect it is desired to 
provide. For instance, for shielding of epitopes and increasing serum half-life, a polymer such as 
5 PEG has been found useful. For increasing targeting to Iysosomes the macromolecular moiety is 
preferably a phosholipid, a lipid or a mannose-containing compound. 

The attachment group to which the macromolecular moiety is conjugated may be one which 
is present in the parent polypeptide, e.g. wtGCB, or may be one, which has been introduced into the 
amino acid sequence thereof and is thus not present in parent. Thereby, the polypeptide is boosted or 

10 otherwise altered in the content of the specific amino acid residues to which the macromolecular 
moiety of choice binds, whereby a more efficient, specific and/or extensive conjugation is achieved. 
For instance, when the total number of amino acid residues comprising an attachment group for the 
macromolecular moiety of choice is increased a greater proportion of the polypeptide molecule is 
shielded and thus a lower immune response will result. In most cases the introduction of an amino 

15 acid residue will be by way of substitution of an amino acid residue. 

The position into which an amino acid residue comprising an attachment group is to be 
introduced is as described above for introduction of an in vitro glycosylation site. The amino acid 
residue comprising an attachment group for the macromolecular moiety is selected on the basis of 
the nature of the macromolecular moiety of choice and, in most instances, on the basis of the type of 

20 macromolecular moiety and the chemistry to be used for achieving the conjugation between the 
polypeptide and the macromolecular moiety. For instance, when the macromolecular moiety is a 
polymer molecule such as a polyethylene glycol or polyalkylene oxide derived molecule an amino 
acid residue comprising a suitable attachment group is normally selected from the group consisting 
of lysine, cysteine, aspartic acid, glutamic acid and arginine. When conjugation to a lysine residue is 

25 to be achieved a suitable activated molecule is, e.g., mPEG-SPA, mPEG-SCM, mPEG-BTC from 
Shearwater Polymers, hie, SC-PEG from Enzon, Inc., tresylated mPEG as described in US 
5,880,255, or oxycarbonyl-oxy-N-dicarboxyimide-PEG (US 5,122,614). 

Preferably, the amino acid residue comprising an attachment group for the macromolecular 
moiety of choice is introduced into a position exposed on the surface of the parent polypeptide, in 

30 particular into a position which in the parent polypeptide is occupied by a charged residue such as an 
arginine, histidine, lysine, glutamic acid and/or aspartic acid residue or a position located between - 
4 and 44 amino acid residues from such charged amino acid residue. 

For instance, when lysine comprises the attachment group, modification of a parent GCB 
polypeptide may be achieved as described for introducion and/or removal of in vitro glycosylation 

35 sites in GCB (section entitled "A GCB polypeptide of the invention"). 
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In a further embodiment, the polypeptide of the invention is one, wherein at least one amino 
acid residue comprising an attachment group for a macromolecular moiety has been removed (as 
compared to the parent GCB). By removing one or more amino acid residues comprising an 
attachment group for a macromolecular moiety of choice it is possible to avoid conjugation to the 
macromolecular moiety in parts of the polypeptide in which such conjugation is disadvantageous, 
e.g. in amino acid residue located at or near a functional site of the polypeptide. In particular in case 
of a polypeptide of the invention comprising one or more additional glycosylation sites, one or more 
amino acid residues comprising an attachment group for the non-oligosaccharide macromolecular 
moiety may be removed, if located at or within 4 amino acid residues of an O- or N-glycosylation 
site (in the primary sequence), since conjugation at such a site may result in inactivation or reduced 
activity of the resulting conjugate due to impaired receptor recognition. 

In a further embodiment the polypeptide of the invention differs from a parent polypeptide, 
e.g. GCB, in that at least one amino acid residue comprising an attachment group for a 
macromolecular moiety has been introduced into the sequence and at least one amino acid residue 
comprising an attachment group for the same macromolecular moiety and present in the parent 
polypeptide has been removed from the sequence. This embodiment is considered of particular 
interest for increasing the serum and/or functional in vivo half-life of a polypeptide of the invention 
and/or for shielding of epitopes, either present in the wildtype molecule, but more likely introduced 
by amino acid or glycosylation modifications of the wildtype molecule. For instance, by introducing 
and removing selected amino acid residues it is possible to ensure an optimal distribution of sites 
capable of attaching the macromolecular moiety of choice, which gives rise to a conjugated 
polypeptide in which the macromolecular moieties are placed so as to effectively shield epitopes and 
other surface parts of the polypeptide without causing too much structural disruption and thereby 
impair the function of the polypeptide. 

As indicated above the non-oligosaccharide macromolecular moiety of the conjugated 
polypeptide according to this embodiment of the invention is preferably a polymer molecule. It may 
confer desirable properties to the polypeptide, in particular increased functional in vivo half-life 
and/or increased serum half-life, and/or reduced immunogenicity and/or reduced susceptibility to 
proteolytic degradation. 

The polymer molecule to be coupled to the polypeptide may be any suitable polymer molecule, 
such as a natural or synthetic homo-polymer or heteropolymer, typically with a molecular weight in the 
range of 300-100,000 Da, such as 300-20,000 Da, more preferably in the range of 500-10,000 Da, even 
more preferably in the range of 500-5000 Da. Examples of horao-polymers include a polyol (i.e. poly- 
OH), a polyamine (i.e. poly-NH 2 ) and a polycarboxylic acid (i.e. poly-COOH). A hetero-polymer is a 
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polymer, which comprises one or more different coupling groups, such as, e.g„ a hydroxyl group and an 
amine group. 

Examples of suitable polymer molecules include polymer molecules selected from the 
group consisting of polyalkylene oxide (PAO), including polyalkylene glycol (PAG), such as 
5 polyethylene glycol (PEG) and polypropylene glycol (PPG), branched PEGs, poly-vinyl alcohol (PVA), 
poly-carboxylate, poly-(vinylpyrolidone), polyethylene-co-maleic acid anhydride, polystyrene-co-malic 
acid anhydride, dextran including carboxymethyl-dextran, or any other biopolymer suitable for reducing 
immunogenic! ty and/or increasing functional in vivo half-life and/or serum half-life. Another example 
of a polymer molecule is human albumin or another abundant plasma protein. Generally, polyalkylene 

]0 glycol-derived polymers are biocompatible, non-toxic, non-antigenic, non-immunogenic, have 
various water solubility properties, and are easily excreted from living organisms. 

PEG is the preferred polymer molecule to be used, since it has only few reactive groups 
capable of cross-linking compared, e.g., to polysaccharides. such as dextran, and the like. In particular, 
monofunctional PEG, e.g. methoxypolyethylene glycol (mPEG), is of interest since its coupling 

15 chemistry is relatively simple (only one reactive group is available for conjugating with attachment 
groups on the polypeptide). Consequently, the risk of cross-linking is eliminated, the resulting 
polypeptide conjugates are more homogeneous and the reaction of the polymer molecules with the 
polypeptide is easier to control. 

To effect covalent attachment of the polymer molecule(s) to the polypeptide, the 

20 hydroxyl end groups of the polymer molecule must be provided in activated form, i.e. with reactive 
functional groups. Suitably activated polymer molecules are commercially available, e.g. from 
Shearwater Polymers, Inc., Huntsville, AL, USA. Alternatively, the polymer molecules can be 
activated by conventional methods known in the art, e.g. as disclosed in WO 90/13540. Specific 
examples of activated linear or branched polymer molecules for use in the present invention are 

25 described in the Shearwater Polymers, Inc. 1997 and 2000 Catalogs (Functionalized Biocompatible 
Polymers for Research and pharmaceuticals, Polyethylene Glycol and Derivatives, incorporated 
herein by reference). Specific examples of activated PEG polymers include the following linear 
PEGs: NHS-PEG (e.g. SPA-PEG, SSPA-PEG, SBA-PEG, SS-PEG, SSA-PEG, SC-PEG, SG-PEG, 
and SCM-PEG), and NOR-PEG), BTC-PEG, EPOX-PEG, NCO-PEG, NPC-PEG, CDI-PEG, ALD- 

30 PEG, TRES-PEG, VS-PEG, IODO-PEG, and MAL-PEG, and branched PEGs such as PEG2-NHS and 
those disclosed in US 5,932,462 and US 5,643,575, both of which references are incorporated herein by 
reference. Furthermore, the following publications, incorporated herein by reference, disclose useful 
polymer molecules and/or PEGylation chemistries: US 5,824,778, US 5,476,653, WO 97/32607, EP 
229,108, EP 402,378, US 4,902,502, US 5,281,698, US 5,122,614, US 5,219,564, WO 92/16555, 

35 WO 94/04193, WO 94/14758, WO 94/17039, WO 94/18247, WO 94/28024, WO 95/00162, WO 
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95/1 1924, WO95/13090, WO 95/33490, WO 96/00080, WO 97/18832, WO 98/41562, WO 
98/48837, WO 99/32134, WO 99/32139, WO 99/32140, WO 96/40791, WO 98/32466, WO 
95/06058, EP439 508, WO 97/03106, WO 96/21469, WO 95/13312, EP 921 131, US 5,736,625, 
WO 98/05363, EP 809 996, US 5,629,384, WO 96/41813, WO 96/07670, US 5,473,034, US 
5 5,516,673, EP 605 963, US 5,382,657, EP 510 356, EP 400 472, EP 183 503 and EP 154 316. 

The conjugation of the polypeptide and the activated polymer molecules is conducted by 
use of any conventional method, e.g. as described in the following references (which also describe 
suitable methods for activation of polymer molecules): R.F. Taylor, (1991), "Protein immobilisation. 
Fundamental and applications", Marcel Dekker, N.Y.; S.S. Wong, (1992), "Chemistry of Protein 

10 Conjugation and Crosslinking", CRC Press, Boca Raton; G.T. Hermanson et al., (1993), "Immobilized 
Affinity Ligand Techniques", Academic Press, N.Y.). The skilled person will be aware that the 
activation method and/or conjugation chemistry to be used depends on the attachment group(s) of the 
polypeptide as well as the functional groups of the polymer (e.g. being amino, hydroxyl, carboxyl, 
aldehyde or sulfydryl). The PEGylation may be directed towards conjugation to all available 

15 attachment groups on the polypeptide (i.e. such attachment groups that are exposed at the surface of 
the polypeptide) or may be directed towards specific attachment groups, e.g. the N-terminal amino 
group (US 5,985,265). Furthermore, the conjugation may be achieved in one step or in a stepwise 
manner (e.g. as described in WO 99/55377). 

It will be understood that the PEGylation is designed so as to produce the optimal 

20 molecule with respect to the number of PEG molecules attached, the size and form (e.g. whether 
they are linear or branched) of such molecules, and where in the polypeptide such molecules are 
attached. For instance, the molecular weight of the polymer to be used may be chosen on the basis of 
the desired effect to be achieved. For instance, if the primary purpose of the conjugation is to achieve a 
conjugate having a high molecule weight (e.g. to reduce renal clearance and thereby increase the serum 

25 and/or functional in vivo half-life) it is usually desirable to conjugate as few high Mw polymer 
molecules as possible to obtain the desired molecular weight When a high degree of epitope or 
proteolytic site shielding is desirable this may be obtained by use of a sufficiently high number of low 
molecular weight polymer (e.g. with a molecular weight of about 5,000 Da) to effectively shield all or 
most epitopes of the polypeptide. For instance, 1-8, such as 1-4 such polymers may be used. 

30 Normally, the polymer conjugation is performed under conditions aiming at reacting all 

available polymer attachment groups with polymer molecules. Typically, the molar ratio of activated 
polymer molecules to polypeptide is 1000-1, in particular 200-1, preferably 100-1, such as 10-1 or 5-1 
in order to obtain optimal reaction. 

It is also contemplated according to the invention to couple the polymer molecules to the 

35 polypeptide through a linker. Suitable linkers are well known to the skilled person. A preferred example 
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is cyanuric chloride (Abuchowski et al., (1977), J. Biol. Chem., 252, 3578-3581; US 4,179,337; Shafer 
et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 375-378. Subsequent to the conjugation residual 
activated polymer molecules are blocked according to methods known in the art, e.g. by addition of 
primary amine to the reaction mixture, and the resulting inactivated polymer molecules removed by 
a suitable method. 

Properties of a polypeptide of the invention 

Preferably, the polypeptide of the invention has at least one of the following properties 

relative to the parent polypeptide or a reference molecule, the properties being measured under 

comparable conditions: 

Increased in vivo activity; 

in vitro bioactivity which is at least 25%, such as at least 50% or at least 75% of that of the parent 
or reference polypeptide as measured under comparable conditions, 

increased affinity for a mannose receptor, mannose-6-phosphate-receptor, or other carbohydrate 
receptors, 

increased serum or functional in vivo half-life, 

reduced renal clearance, 

reduced immunogenicity, 

increased resistance to proteolytic cleavage, 

increased targeting to and/or uptake in phagocytic cells, such as macrophages or macrophage like 
cells or a suborganel compartment thereof (lysosomes) or other subpopulations of human cells (e.g. 
muscle cells, fibroblasts, etc.) of relevance for the specific polypeptide of the invention, 
improved stability in production, improved shelf life, improved formulation, e.g. liquid formulation, 
improved purification, improved solubility, and/or improved expression. 

Improved properties are determined by conventional methods known in the art for 
determining such properties or as described herein. 

Methods of preparing a polypeptide of the invention 

The invention further comprises a method of producing the present polypeptide comprising 
culturing a host cell transformed or transfected with a nucleotide sequence encoding the polypeptide 
under conditions permitting the expression of the polypeptide, and recovering the polypeptide from 
the culture. 

The term "nucleotide sequence" is intended to indicate a consecutive stretch of two 
or more nucleotide molecules. The nucleotide sequence may be of genomic, cDNA, RNA, 
semisynthetic, synthetic origin, or any combinations thereof. 
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The terms "cell", "host cell", "cell line" and "cell culture" are used interchangeably herein 
and all such terms should be understood to include progeny resulting from growth or culturing of a 
cell. 'Transformation" and "transfection" are used interchangeably to refer to the process of 
introducing DNA into a cell. 

Apart from recombinant production, polypeptides of the invention may be produced, albeit 
less efficiently, by chemical synthesis or a combination of chemical synthesis and recombinant 
DNA technology. 

The nucleotide sequence of the invention encoding a polypeptide of the invention may be 
constructed by isolating or synthesizing a nucleotide sequence encoding the relevant parent 
polypeptide (in the case of GCB for instance wt GCB with the amino acid sequence shown in SEQ 
ID NO: 1) and then changing the nucleotide sequence so as to effect introduction (i.e. insertion or 
substitution) or removal (i.e. deletion or substitution) of the relevant amino acid residue(s). The 
nucleotide sequence is conveniently modified by site-directed mutagenesis in accordance with well- 
known methods, e.g. as described in Nelson and Long, Analytical Biochemistry 180, 147-151, 1989. 

Alternatively, the nucleotide sequence may be prepared by chemical synthesis, e.g. by using 
an oligonucleotide synthesizer, wherein oligonucleotides are designed based on the amino acid 
sequence of the desired polypeptide, and preferably selecting those codons that are favoured in the 
host cell in which the recombinant polypeptide will be produced. For example, several small 
oligonucleotides coding for portions of the desired polypeptide may be synthesized and assembled 
by PCR, ligation or ligation chain reaction (LCR). The individual oligonucleotides typically contain 
5' or 3' overhangs for complementary assembly. 

Once assembled (by synthesis, site-directed mutagenesis or another method), the nucleotide 
sequence encoding the polypeptide may be inserted into a recombinant vector and operably linked to 
control sequences necessary for expression of the polypeptide in the desired transformed host cell. 

It should of course be understood that not all vectors and expression control sequences 
function equally well to express the nucleotide sequence encoding a polypeptide of the invention. 
Neither will all hosts function equally well with the same expression system. However, one of skill 
in the art may make a selection among these vectors, expression control sequences and hosts without 
undue experimentation. For example, in selecting a vector, the host must be considered because the 
vector must replicate in it or be able to integrate into the chromosome. The vector's copy number, 
the ability to control that copy number, and the expression of any other proteins encoded by the 
vector, such as antibiotic markers, should also be considered. In selecting an expression control 
sequence, a variety of factors should also be considered These include, for example, the relative 
strength of the sequence, its controllability, and its compatibility with the nucleotide sequence 
encoding the polypeptide, particularly as regards potential secondary structures. Hosts should be 
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selected by consideration of their compatibility with the chosen vector, the toxicity of the product 
coded for by the nucleotide sequence, their secretion characteristics, their ability to fold the 
polypeptide correctly, their fermentation or culture requirements, and the ease of purification of the 
products coded for by the nucleotide sequence. 
5 The recombinant vector may be an autonomously replicating vector, i.e. a vector which 

exists as an extrachromosomal entity, the replication of which is independent of chromosomal 
replication, e.g. a plasmid. Alternatively, the vector is one which, when introduced into a host cell, is 
integrated into the host cell genome and replicated together with the chromosome(s) into which it 
has been integrated. 

10 The vector is preferably an expression vector, in which the nucleotide sequence encoding 

the polypeptide of the invention is operably linked to additional segments required for transcription 
of the nucleotide sequence. The vector is typically derived from plasmid or viral DNA. A number of 
suitable expression vectors for expression in the host cells mentioned herein are commercially 
available or described in the literature. Useful expression vectors for eukaryotic hosts, include, for 

15 example, vectors comprising expression control sequences from SV40, bovine papilloma virus, 
adenovirus and cytomegalovirus. Specific vectors are, e.g., pCDNA3.1(+)\Hyg (Invitrogen, 
Carlsbad, CA, USA) and pCI-neo (Stratagene, La Jolla, CA, USA), Useful expression vectors for 
yeast cells include the 2fi plasmid and derivatives thereof, the POT1 vector (US 4,931,373), the 
pJS037 vector described in (Okkels, Ann. New York Acad. Sci. 782, 202-207, 1996) and pPICZ A, 

20 B or C (Invitrogen, Carlsbad, CA, USA). Useful vectors for insect cells include pVL941, pBG311 
(Cate et al., "Isolation of the Bovine and Human Genes for Mullerian Inhibiting Substance And 
Expression of the Human Gene In Animal Cells", Cell, 45, pp. 685-98 (1986), pBIuebac 4.5 and 
pMelbac (both available from Invitrogen, Carlsbad, CA, USA); 

Other vectors for use in this invention include those that allow the nucleotide sequence 

25 encoding the polypeptide to be amplified in copy number. Such amplifiable vectors are well known 
in the art. They include, for example, vectors able to be amplified by DHFR amplification (see, e.g., 
Kaufman, U.S. Pat. No. 4,470,461, Kaufman and Sharp, "Construction Of A Modular Dihydrafolate 
Reductase cDNA Gene: Analysis Of Signals Utilized For Efficient Expression", Mol. Cell. Biol., 2, 
pp. 1304-19 (1982)) and glutamine synthetase ("GS") amplification (see, e.g., US 5,122,464 and EP 

30 338,841). 

The recombinant vector may further comprise a DNA sequence enabling the vector to 
replicate in the host cell in question. An example of such a sequence (when the host cell is a 
mammalian cell) is the SV40 origin of replication. When the host cell is a yeast cell, suitable 
sequences enabling the vector to replicate are the yeast plasmid 2p, replication genes REP 1-3 and 
35 origin of replication. 
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The vector may also comprise a selectable marker, e.g. a gene the product of which 
complements a defect in the host cell, such as the gene coding for dihydrofolate reductase (DHFR) 
or the Schizosaccharomyces pombe TPI gene (described by P.R. Russell, Gene 40, 1985, pp. 125- 
130), or one which confers resistance to a drug, e.g. ampicillin, kanamycin, tetracyclin, 
5 chloramphenicol, neomycin, hygromycin or methotrexate. For filamentous fungi, selectable markers 
include amdS. pyrG, arcB, niaD. sC. 

The term "control sequences" is defined herein to include all components, which are 
necessary or advantageous for the expression of the polypeptide of the invention. Each control 
sequence may be native or foreign to the nucleic acid sequence encoding the polypeptide. Such 
10 control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide 
sequence, promoter, enhancer or upstream activating sequence, signal peptide sequence, and 
transcription terminator. At a minimum, the control sequences include a promoter operably linked to 
the nucleotide sequence encoding the polypeptide. 

"Operably linked" refers to the covalent joining of two or more nucleotide sequences, by 
15 means of enzymatic ligation or otherwise, in a configuration relative to one another such that the 
normal function of the sequences can be performed. For example, the nucleotide sequence encoding 
a presequence or secretory leader is operably linked to a nucleotide sequence for a polypeptide if it 
is expressed as a preprotein that participates in the secretion of the polypeptide: a promoter or 
enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; a 
20 ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the nucleotide sequences being linked are 
contiguous and, in the case of a secretory leader, contiguous and in reading phase. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, then synthetic 
oligonucleotide adaptors or linkers are used, in conjunction with standard recombinant DNA 
25 methods. 

A wide variety of expression control sequences may be used in the present invention. Such 
useful expression control sequences include the expression control sequences associated with 
structural genes of the foregoing expression vectors as well as any sequence known to control the 
expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations 
30 thereof. 

Examples of suitable control sequences for directing transcription in mammalian cells 
include the early and late promoters of SV40 and adenovirus, e.g. the adenovirus 2 major late 
promoter, the MT-1 (metallothionein gene) promoter, the human cytomegalovirus immediate-early 
gene promoter (CMV), the human elongation factor la (EF-la) promoter, the Drosophila minimal 
35 heat shock protein 70 promoter, the Rous Sarcoma Virus (RSV) promoter, the human ubiquitin C 
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(UbC) promoter, the human growth hormone terminator, SV40 or adenovirus Elb region 
polyadenylation signals and the Kozak consensus sequence (Kozak, M. J Mol Biol 1987 Aug 
20;196(4):947-50); 

In order to improve expression in mammalian cells a synthetic intron may be inserted in the 
5 5' untranslated region of the nucleotide sequence encoding the polypeptide of the invention. An 
example of a synthetic intron is the synthetic intron from the plasmid pCI-Neo (available from 
Promega Corporation, WI, USA). 

Examples of suitable control sequences for directing transcription in insect cells include the 
polyhedrin promoter, the P10 promoter, the Autographa califomka polyhedrosis virus basic protein 
10 promoter, the baculovirus immediate early gene 1 promoter and the baculovirus 39K delayed-early 
gene promoter, and the SV40 polyadenylation sequence. 

Examples of suitable control sequences for use in yeast host cells include the promoters of 
the yeast a-mating system, the yeast triose phosphate isomerase (TPI) promoter, promoters from 
yeast glycolytic genes or alcohol dehydogenase genes, the ADH2-4c promoter and the inducible 
15 GAL promoter. 

Examples of suitable control sequences for use in filamentous fungal host cells include the 
ADH3 promoter and terminator, a promoter derived from the genes encoding Aspergillus oryzae 
TAKA amylase triose phosphate isomerase or alkaline protease, an A. niger a-amylase, A niger or 
A. nidulans glucoamylase, A. nidulans acetamidase, Rkizomucor miehei aspartic proteinase or 

20 lipase, the TPII terminator and the ADH3 temiinator. 

The nucleotide sequence of the invention encoding a GCB polypeptide, whether prepared by 
site-directed mutagenesis, synthesis or other methods, may or may not also include a nucleotide 
sequence that encode a signal peptide. The signal peptide is present when the polypeptide is to be 
secreted from the cells in which it is expressed. Such signal peptide, if present, should be one 

25 recognized by the cell chosen for expression of the polypeptide. The signal peptide may be 

homologous (e.g. be that normally associated with human GCB) or heterologous (i.e. originating 
from another source than human GCB) to the polypeptide or may be homologous or heterologous to 
the host cell, i.e. a signal peptide normally expressed from the host cell or one which is not normally 
expressed from the host cell. Accordingly, the signal peptide may be prokaryotic, e.g. derived from a 

30 bacterium, or eukaryotic, e.g. derived from a mammalian, or insect, filamentous fungal or yeast cell. 
The presence or absence of a signal peptide will, e.g., depend on the expression host cell 
used for the production of the polypeptide, the protein to be expressed (whether it is an intracellular 
or extracelluar protein) and whether it is desirable to obtain secretion. For use in filamentous fungi, 
the signal peptide may conveniently be derived from a gene encoding an Aspergillus sp. amylase or 

35 glucoamylase, a gene encoding a Rhizomucor miehei lipase or protease or a Humicola lanuginosa 
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lipase. The signal peptide is preferably derived from a gene encoding A. oryzae TAKA amylase, A. 
niger neutral a-amylase, A. niger acid-stable amylase, or A. niger glucoamylase. For use in insect 
cells, the signal peptide may conveniently be derived from an insect gene (cf. WO 90/05783), such 
as the lepidopteran Manduca sexta adipokinetic hormone precursor, (cf. US 5,023,328), the 
5 honeybee melittin (Invitrogen, Carlsbad, CA, USA), ecdysteroid UDP glucosyltransferase (egt) 
(Murphy et al., Protein Expression and Purification 4, 349-357 (1993) or human pancreatic lipase 
(hpl) (Methods in Enzymology 284, pp. 262-272, 1997). 

A preferred signal peptide for use in mammalian cells is that of human GCB apparent from 
the examples hereinafter when the polypeptide is a GCB polypeptide) or the murine Ig kappa light 

to chain signal peptide (Coloma, M (1992) J. Imm. Methods 152:89-104). For use in yeast cells 

suitable signal peptides have been found to be the a-factor signal peptide from S. cereviciae. (cf. US 
4,870,008), the signal peptide of mouse salivary amylase (cf. O. Hagenbuchle et al., Nature 
289, 1981, pp. 643-646), a modified carboxypeptidase signal peptide (cf. L.A. Vails et al., Cell 48, 
1987, pp. 887-897), the yeast BAR1 s ignal peptide (cf. WO 87/02670), and the yeast aspartic 

15 protease 3 (YAP3) signal peptide (cf. M. Egel-Mitani et al., Yeast 6, 1990, pp. 127-137). 

Any suitable host may be used to produce the polypeptide of the invention, including 
bacteria, fungi (including yeasts), plant, insect, mammal, or other appropriate animal cells or cell 
lines, as well as transgenic animals or plants. When a non-glycosylating organism such as K coli is 
used, and the polypeptide of the invention is to be a glycosylated polypeptide, the expression in E. 

20 coli is preferably followed by suitable in vitro glycosylation. 

Examples of bacterial host cells include grampositive bacteria such as strains of Bacillus, 
e.g. B. brevis or B. subtilis, Pseudomonas or Streptomyces, or gramnegative bacteria, such as strains 
of E. coli. The introduction of a vector into a bacterial host cell may, for instance, be effected by 
protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168: 111- 

25 115), using competent cells (see, e.g.. Young and Spizizin, 1961, Journal of Bacteriology 81: 823- 
829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 56: 209-221), 
electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or conjugation 
(see, e.g„ Koehler and Thome, 1987, Journal of Bacteriology 169: 5771-5278). 

Examples of suitable filamentous fungal host cells include strains of Aspergillus, e.g, A. 

30 oryzae, A. niger, or A. nidulans, Fusarium or Trichoderma. Fungal cells may be transformed by a 
process involving protoplast formation, transformation of the protoplasts, and regeneration of the 
cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus host cells 
are described in EP 238 023 and US 5,679,543. Suitable methods for transforming Fusarium species 
are described by Malardier et al, 1989, Gene 78: 147-156 and WO 96700787. Yeast may be 

35 transformed using the procedures described by Becker and Guarente, In Abelson, J.N. and Simon, 
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M.I., editors, Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 
194, pp 182-187, Academic Press, hie, New York; Ito et al. t 1983, Journal of Bacteriology 153: 
163; and Hinnen et al., 1978, Proceedings of the National Academy of Sciences USA 75: 1920. 

The host cell is preferably selected from a group of host cells capable of generating the 
desired glycosylation of the polypeptide for improved lysosomal activity. Thus, the host cell may 
advantageously be selected from a yeast cell, insect cell, or mammalian cell. 
Examples of suitable yeast host cells include strains of Saccharomyces, e.g. S. cerevisiae, 
Schizosaccharomyces, Klyveromyces. Pichia, such as P. pastoris or P. methanolica, Hansenula, 
such as H. polymorpha or yarrowia. Of particular interest are yeast glycosylation mutant cells, e.g. 
derived from S. cereviciae, P. pastoris or Hansenula spp. (e.g. the S. cereviciae glycosylation 
mutants ochl , ochi mnml or ochl mnml alg3 described by Nagasu et al. Yeast 8, 535-547, 1992 
and Nakanisho-Shindo et al. J. Biol. Chem. 268, 26338-26345, 1993). Methods for transforming 
yeast cells with heterologous DNA and producing heterologous polypeptides therefrom are 
disclosed by Clontech Laboratories, Inc. Palo Alto, CA, USA (in the product protocol for the 
Yeastmaker™ Yeast Tranformation System Kit), and by Reeves et al., FEMS Microbiology Letters 
99 (1992) 193-198, Manivasakam and Schiestl, Nucleic Acids Research, 1993, Vol. 21, No. 18, pp. 
4414-4415 and Ganeva et al., FEMS Microbiology Letters 121 (1994) 159-164. 

Examples of suitable insect host cells include a Lepidoptora cell line, such as Spodoptera 
frugiperda (Sf9 or Sf21) or Trickoplusia ni cells (High Five) (US 5,077,214). Transformation of 
insect cells and production of heterologous polypeptides therein may be performed as described by 
bivitrogen, Carlsbad, CA, USA. 

Examples of suitable mammalian host cells include Chinese hamster ovary (CHO) cell lines, (e.g. 
CHO-K1; ATCC CCL-61), Green Monkey cell lines (COS) (e.g. COS 1 (ATCC CRL-1650), COS 7 
(ATCC CRL-1651)); mouse cells (e.g. NS/O), Baby Hamster Kidney (BHK) cell lines (e.g. ATCC 
CRL-1632 or ATCC CCL-10), and human cells (e.g. HEK 293 (ATCC CRL-1573)), as well as plant 
cells in tissue culture. Additional suitable cell lines are known in the art and available from public 
depositories such as the American Type Culture Collection, Rockville, Maryland. Of particular 
interest for the present purpose are a mammalian glycosylation mutant cell line, such as CHO- 
LEC1, CHOL-LEC2 or CHO-LEC18 (CHO-LEC1: Stanley et al. Proc. Natl. Acad. USA 72, 3323- 
3327, 1975 and Grossmann et al., J. Biol. Chem. 270, 29378-29385, 1995, CHO-LEC18: Raju et al. 
J. Biol. Chem. 270, 30294-30302, 1995). 

In a specific aspect the invention relates to a glycosylation mutant derived from yeast, e.g. 
Saccharomyces cerevisiae, Pichia pastoris or Hansenula spp. or a mammalian glycosylation mutant 
cell line as mentioned above comprising a heterologous nucleotide sequence encoding a lysosomal 
enzyme or a lysosmal enzyme activator, in particular GCB polypeptide. The lysosomal enzyme may 
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be a wt enzyme or a polypeptide as described in the present invention. Likewise the activator may be 
a wt activator or a polypeptide as described herein. The mammalian glycosylation mutant cell line is 
preferably CHO-LEC1. 

Methods for introducing exogeneous DNA into mammalian host cells include calcium 
phosphate-mediated transfection, electroporation, DEAE-dextran mediated transfection, liposome- 
mediated transfection, viral vectors and the transfection method described by Life Technologies Ltd, 
Paisley, UK using Lipofectamin 2000. These methods are well known in the art and e.g. described 
by Ausbel et al. (eds.), 1996, Current Protocols in Molecular Biology, John Wiley & Sons, New 
York, USA. The cultivation of mammalian cells are conducted according to established methods, 
e.g. as disclosed in (Animal Cell Biotechnology, Methods and Protocols, Edited by Nigel Jenkins, 
1999, Human Press Inc. Totowa, New Jersey, USA and Harrison MA and Rae IF, General 
Techniques of Cell Culture, Cambridge University Press 1997). 

In the production methods of the present invention, the cells are cultivated in a nutrient 
medium suitable for production of the polypeptide using methods known in the art. For example, 
the cell may be cultivated by shake flask cultivation, smail-scale or large-scale fermentation 
(including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial 
fermenters performed in a suitable medium and under conditions allowing the polypeptide to be 
expressed and/or isolated. The cultivation takes place in a suitable nutrient medium comprising 
carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media 
are available from commercial suppliers or may be prepared according to published compositions 
(e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the 
nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is 
not secreted, it can be recovered from cell Iysates. 

The resulting polypeptide may be recovered by methods known in the art. For example, the 
polypeptide may be recovered from the nutrient medium by conventional procedures including, but 
not limited to, centrifugation, filtration, extraction, spray drying, evaporation, or precipitation. 

The polypeptides may be purified by a variety of procedures known in the art including, but 
not limited to, chromatography {e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and 
size exclusion), electrophoretic procedures (e.g., preparative isoelectric focusing), differential 
solubility (e.g., ammonium sulfate precipitation), SDS-PAGE, or extraction (see, e.g., Protein 
Purification, J-C Janson and Lars Ryden, editors, VCH Publishers, New York, 1989). Specific 
methods for purifying GCB polypeptides are disclosed in US 5,236,838 and Osiecki-Newman et al„ 
Enzyme 35, 147-153, 1986. 
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Other Methods of the invention 
Introduction of glycosylation sites 

While glycosylation sites (or other attachment groups as described herein) can be introduced by a 
5 strictly directed approach (e.g. based on site-directed mutagenesis), it is also possible to use a 

random approach based on random mutagenesis, recombination, shuffling, or any other technology. 
For instance, a nucleotide sequence encoding a polypeptide of the invention (optionally including an 
N- or C-terminal peptide addition and/or being a chimeric polypeptide) can be constructed from two 
or more nucleotide sequences encoding the polypeptide, the sequences being sufficiently 

10 homologous to allow recombination between the sequences, in particular in the part thereof where 
the glycosylation site or other attachment group (or peptide addition) is to be introduced. The 
combination of nucleotide sequences or sequence parts is conveniently conducted by methods 
known in the art, for instance methods which involve homologous cross-over such as disclosed in 
US 5,093,257, or methods which involve gene shuffling, i.e., recombination between two or more 

15 homologous nucleotide sequences resulting in new nucleotide sequences having a number of 
nucleotide alterations when compared to the starting nucleotide sequences. In order for homology 
based nucleic acid shuffling to take place the relevant parts of the nucleotide sequences are 
preferably at least 50% identical, such as at least 60% identical, more preferably at least 70% 
identical, such as at least 80% identical. The recombination can be performed in vitro or in vivo. 

20 Examples of suitable in vitro gene shuffling methods are disclosed by Stemmer et al (1994), Proc. 
Natl. Acad. Sci. USA; vol. 91, pp. 10747-10751; Stemmer (1994), Nature, vol. 370, pp. 389-391; 
Smith (1994), Nature vol. 370, pp. 324-325; Zhao et al., Nat. Biotechnol. 1998, Mar; 16(3): 258-61; 
Zhao H. and Arnold, FB, Nucleic Acids Research, 1997, Vol. 25. No. 6 pp. 1307-1308; Shao et al., 
Nucleic Acids Research 1998, Jan 15; 26(2): pp. 681-83; and WO 95/17413. Example of a suitable 

25 in vivo shuffling method is disclosed in WO 97/07205. 

Furthermore, a nucleotide sequence encoding a polypeptide of the invention can be 
constructed by preparing a randomly mutagenized library, conveniently prepared by subjecting a 
nucleotide sequence encoding the polypeptide (or, when relevant, the peptide addition) to random 
mutagenesis to create a large number of mutated nucleotide sequences. While the random 

30 mutagenesis can be entirely random, both with respect to where in the nucleotide sequence the 

mutagenesis occurs and with respect to the nature of mutagenesis, it is preferably conducted so as to 
randomly mutate only the part of the sequence in which a glycosylation site or other attachment 
group is to be introduced or the part encoding the peptide addition. The random mutagenesis can be 
directed towards introducing certain types of amino acid residues, in particular amino acid residues 

35 containing a glycosylation site or other attachment group, at random into the polypeptide molecule 
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or at random into a peptide addition part thereof. Besides substitutions, random mutagenesis can 
also cover random introduction of insertions or deletions. Preferably, the insertions are made in 
reading frame, e.g., by performing multiple introduction of three nucleotides as described by Hallet 
et al., Nucleic Acids Res. 1997, 25(9): 1866-7 and Sondek and Shrotle, Proc Natl. Acad. Sci USA 
1992, 89(8):3581-5. 

The random mutagenesis (either of the whole nucleotide sequence or a part thereof, 
e.g. the part encoding the peptide addition) can be performed by any suitable method. For example, 
the random mutagenesis is performed using a suitable physical or chemical mutagenizing agent, a 
suitable oligonucleotide, PCR generated mutagenesis or any combination of these mutagenizing 
agentsand/or other methods according to state of the art technology, e.g. as disclosed in WO 
97/07202. 

Error prone PCR generated mutagenesis, e.g. as described by J.O. Deshler (1992), 
GATA 9(4): 103-106 and Leung et al. Technique (1989) Vol. 1, No. 1, pp. 11-15, is particularly 
useful for mutagenesis of longer peptide stretches (corresponding to nucleotide sequences 
containing more than 100 bp) or entire genes, and are preferably performed under conditions that 
increase the misincorporation of nucleotides. 

Random mutagenesis based on doped or spiked oligonucleotides or by specific 
sequence oligonucleotides, is of particular use for mutagenesis of the part of the nucleotide sequence 
encoding the peptide addition. 

Random mutagenesis of the part of the nucleotide sequence encoding the peptide 
addition can be performed using PCR generated mutagenesis, in which one or more suitable 
oligonucleotide primers flanking the area to be mutagenized are used. In addition, doping or spiking 
with oligonucleotides can be used to introduce mutations so as to remove or introduce glycosylation 
sites. State of the art knowledge and computer programs (e.g. as described by Siderovski DP and 
MakTW, Comput. Biol. Med. (1993) Vol. 23, No. 6, pp. 463-474 and Jensen et al. Nucleic Acids 
Research, 1998, Vol. 26, No. 3) can be used for calculating the most optimal nucleotide mixture for 
a given amino acid preference. The oligonucleotides can be incorporated into the nucleotide 
sequence encoding the peptide addition by any published technique using e.g. PCR, LCR or any 
DNA polymerase or ligase. 

According to a convenient PCR method the nucleotide sequence encoding the 
polypeptide of the invention or, e.g., a peptide addition thereof, is used as a template and, e.g., 
doped or specific oligonucleotides are used as primers. In addition, cloning primers localized outside 
the targetted region can be used. The resulting PCR product can either directly be cloned into an 
appropriate expression vector or gel purified and amplified in a second PCR reaction using the 
cloning primers and cloned into an appropriate expression vector. 
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In addition to the random mutagenesis methods described herein, it is occasionally useful to 
employ site specific mutagenesis techniques to modify one or more selected amino acids in the 
polypeptide, in particular to optimise the polypeptide with respect to the number of glycosylation 
sites. 

Furthermore, random elongation mutagenesis as described by Matsuura et al, Nature 
Biotechnology, 1999, Vol. 17, 58-61, can be used to construct a nucleotide sequence encoding the 
polypeptide of the invention having a C-terminal peptide addition. Construction of a nucleotide 
sequence encoding the polypeptide of the invention having an N-terminal peptide addition can be 
constructed in an analogous way. 

Also, the methods disclosed in WO 97/04079, the contents of which are incorporated herein 
by reference, can be used for constructing a nucleotide sequence encoding a polypeptide of the 
invention. 

The nucleotide sequence(s) or nucleotide sequence region(s) to be mutagenized is typically 
present on a suitable vector such as a plasmid or a bacteriophage, which as such is incubated with or 
oiherwise exposed to the mutagen izing agent The nucleotide sequence(s) to be mutagenized can 
also be present in a host cell either by being integrated into the genome of said cell or by being 
present on a vector harboured in the cell. Alternatively, the nucleotide sequence to be mutagenized 
is in isolated form. The nucleotide sequence is preferably a DNA sequence such as a cDNA, 
genomic DNA or synthetic DNA sequence. 

Subsequent to the incubation with or exposure to the mutagenizing agent, the mutated nucleotide 
sequence, normally in amplified form, is expressed by culturing a suitable host cell carrying the 
nucleotide sequence under conditions allowing expression to take place. The host cell used for this 
purpose is one, which has been transformed with the mutated nucleotide sequence(s), optionally 
present on a vector, or one which carried the nucleotide sequence during the mutagenesis, or any 
kind of gene library. 

Constructing a peptide addition 

As a non-limiting example an N-terminal peptide addition containing N-glycosylation sites 
can be designed on the basis of the following formula: 
Y'(NXT/S)Y 2 (NXT/S)zY 3 -P, 

wherein each of Y 1 , Y 2 and Y 3 independently is absent or 1, 2, 3 or 4 amino acid residues of any 
type, X a single amino acid residue of any type except for proline, Z any integer between 0 and 6, 
T/S a threonine or serine residue, preferably a threonine residue, and N is an asparagine residue and 
P is the lysosomal enzyme or activator to be modified. 

In a first step about 10 different muteins are made that has the above formula. For instance, 
the about 10 muteins are designed on the basis that each of Y 1 , Y 2 and Y 3 independently is 1 or 2 
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alanine residues or is absent, Z any integer between 0 and 5, T/S threonine, and X alanine. Based on, 
e.g., in vitro bioactivity and half-life results obtained with these muteins (or any other relevant 
property), optimal number(s) of amino acids and glycosylation(s) can be determined and new 
muteins can be constructed based on this information. The process is repeated until an optimal 
5 glycosylated polypeptide is obtained. 

Alternatively, random mutagenesis may be used for creating N-terminally extended 
polypeptides. For instance, a random mutagenized library is made on the basis of the above formula. 
Doped oligonucleotides are synthesized coding for one amino acid residue in position X (the amino 
acid residue being different from proline), each of Y 1 , Y 2, and Y 3 independently is 0, 1 or 2 amino 
10 acid residues of any type, Z is 2 and T is threonine and used for constructing the random 
mutagenized library. 

As another non-limiting example an N-terminal peptide addition containing an in vitro 
glycosylation site can be designed on the basis of the following formula (using a lysine residue as an 
example of such site): 
15 Y'(K)Y 2 (K)zY 3 -P, 

wherein each of Y 1 , Y 2 and Y 3 independently is 0, 1, 2, 3 or 4 amino acid residues of any type 
except lysine, Z an integer between 0 and 6, K lysine, and P is the lysosomal enzyme or activator. 

m a first step about 10 different muteins are made that has the above formula. For instance, 
the about 10 muteins are designed on the basis that each of Y 1 , Y 2 and Y 3 independently is 1 or 2 

20 alanine residues or is absent, Z any integer between 0 and 5, and X alanine. The muteins are then 
glycosylated with a suitable oligosaccharide moiety. Based on, e.g., in vitro bioactivity and half-life 
results obtained with these muteins (or any other relevant property), optimal numbers) of amino 
acids and glycosylation sites can be determined and new muteins can be constructed based on this 
information. The process is repeated until an optimal glycosylated polypeptide is obtained. . 

25 Alternatively, random mutagenesis may be performed by making a random mutagenized library 
based on the above formula. Doped oligonucleotides are synthesized coding for one amino acid 
residue in position X (expect proline) and each of Y 1 , Y 2, and Y 3 independently is 0, 1 or 2 amino 
acid residues of any type, and Z is 2 and used for constructing the random mutagenized library. 
It will be understood that the above design schemes are intended for illustration purposes only and 

30 that a person skilled in the art will be aware of alternative useful routes for design of peptide 

addition. Furthermore, it will be understood that peptide additions with other attachment groups can 
be designed in an analogous way. 

Furthermore, a nucleotide sequence encoding a polypeptide of the invention comprising an 
N- or C-terminal peptide addition can be prepared by a method comprising 

35 a) subjecting a nucleotide sequence encoding the parent polypeptide to elongation mutagenesis, 
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b) expressing the mutated nucleotide sequence obtained in step a) in a suitable host cell to obtain an 
in vivo glycosylated polypeptide or subjecting the expressed polypeptide to in vitro glycosylation or 
conjugation to a non-oligosaccharide macromolecular moiety, as appropriate, 

c) selecting glycosylated and/or conjugated polypeptides comprising at least one oligosaccharide or 
non-oligosaccharide macromolecular moiety attached to the peptide addition part of the polypeptide, 
and 

d) isolating a nucleotide sequence encoding the polypeptide part of conjugates selected in step c). 

In the present context the term "elongation mutagenesis" is intended to indicate any manner 
in which the nucleotide sequence encoding the parent polypeptide can be extended to further encode 
a peptide addition as described herein above. For instance, a nucleotide sequence encoding a 
peptide addition of a suitable length may be synthesized and fused to a nucleotide sequence 
encoding the polypeptide. The resulting fused nucleotide sequence may then be subjected to further 
modification by any suitable method, e.g. one which involves gene shuffling, other recombination 
between nucleotide sequences, random mutagenesis, random elongation, mutagenesis or any 
combination of these methods (as described in the immediately preceding section). 

The expression and conjugation steps are conducted as described in further detail elsewhere 
in the present application, and the selection step c) using any suitable method available in the art. 

In one embodiment the above method further comprises screening conjugates resulting from 
step b) for at least one improved property, in particular any of those improved properties listed 
herein, one step prior to the selection step, and wherein the selection step c) further comprises 
selecting conjugates having such improved property. 

Furthermore, in the above method the elongation mutagenesis can be conducted so as to 
enrich for codons encoding an amino acid residue comprising an attachment group for the 
oligosaccharide or non-oligosaccharide macromolecular moiety, in particular an in vivo 
glycosylation site. 

Usually, when a polypeptide conjugate has been selected in a screening step of a method of the 
invention the nucleotide sequence encoding the polypeptide part of the conjugate is isolated and 
used for expression of larger amounts of the polypeptide. The amino acid sequence of the resulting 
polypeptide is determined and the polypeptide is subjected to conjugation in a larger scale. 
Subsequently, the polypeptide conjugate is assayed with respect to the property to be improved. 

Assays for biological activity 

Secondary screening can be performed to characterize the binding and uptake of the present 
polypeptides in macrophages. This is illustrated herein for GCB polypeptides, but a similar approach 
can be used for testing properties of other lysosomal enzymes. 
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It has been shown that GCB is taken up primarily by macrophages through the macrophage 
mannose receptor. Though many macrophage cell lines do not express functional macrophage 
mannose receptors, the murine macrophage cell line J774E has been found positive for this receptor 
(Blum et al., 1991, Carbohydr.Res 213, 145-153;). The uptake can either be measured by 
5 radioactively labelled GCB polypeptide or, as preferred, by enzyme activity assays on lyzed cells 
after uptake of the polypeptide (The combined uptake/activity assay is described in further detail in 
the examples section herein). 

As an alternative to the murine macrophage cell line J774E, peritoneal macrophages can be isolated 
6-8 weeks old BALB/CBYJ mice and used for studying the uptake of radioactively iabelled GCB 

10 polypeptides (or the combined uptake/enzyme-activity assay). 

In a further aspect the invention relates to an assay method for measuring the efficiency of 
cellular uptake of a GCB polypeptide into cultured macrophage cells, the method comprising 
culturing J774E Murine macrophage cells in a medium containing the GCB polypeptide for a 
sufficient period of time allowing for uptake of the GCB polypeptide, lysing said cells in the 

15 presence of a buffer containing a substrate for the GCB polypeptide, and measuring the amount of 
enzyme activity in the lysate. 

The GCB to be assayed can be any GCB polypeptide, in particular a wtGCB or a functional 
fragment or variant thereof. In particular, the GCB polypeptide to be assayed may be a polypeptide 
of the present invention. In the method according to this aspect, a preferred substrate is para- 

20 nitrophenyl-glucopyranoside or 4-methylumbelliferyl-glucopyranoside. 

The pharmacokinetics and -dynamics of the present polypeptides may be studied to select 
for such polypeptides that exhibit a longer functional in vivo half-life in order to ensure infrequent 
dosing and prevent the low plasma levels seen with the currently available GCBs. The 
pharmacokinetics is studied by intravenous administration of the present polypeptides and thereafter 

25 determination of plasma clearance and cell specific distribution in liver and spleen by utilizing the 
GCB Activity Assay. Friedmann et al.,1999, Blood ,93; 2807-2816, have published a protocol to 
separate phagocytic Kupfer cells from other liver endothelial cells and thereafter study the cell 
specific uptake of administered GCB. Also, a suitable method is disclosed in the Methods section 
herein. Preferred polypeptides should either have slower plasma or lysosomal clearance and/or an 

30 improved lysosomal uptake. 

Therapeutic utility 

While the polypeptide of the invention may be useful in the treatment of various types of diseases 
and disorders, it is presently contemplated to be of particular utility for substitution therapy in the 
35 prevention or treatment of a lysosomal storage disease treatable by the lysosomal enzyme of the 
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polypeptide. When the polypeptide of the invention is a GCB, SapC or SapA polypeptide, the 
disease to be treated is preferably Gaucher's disease, in particular the Type I Gaucher's disease. 
Thus, in a preferred aspect, the present invention relates to the use of a GCB, SapC or SapA 
polypeptide of the invention for the manufacture of a medicament for the prevention or treatment of 
5 Gaucher's disease. Furthermore, the invention relates to a method of treating Gaucher's disease by 
administering, to a patient in need thereof, an effective amount of the GCB or SapC polypeptide, or 
a pharmaceutical composition of the invention. Analogously, when the polypeptide of the invention 
is alpha-galactosidase or SapB, it may be used in the treatment of Fabry's disease, when ceremidase 
or SapD, it may be used in the treatment of Farber's disease, when beta-galactosidase it may be used 

JO in the treatment of G„,i gangliosidosis, when beta-hexosaminidase or GM-2 activator, it may be used 
in the treatment of Tay-Sachs dieases, when sphingomyelinase in Niemann-Pick disase, when alpha- 
N-acetylgalactosaminidase for the treatment of Sly syndrome, when iduronidase for the treatment of 
Huler/Scheie syndrome, when galactocerebrosidase for the treatment of Batten disease, and when 
alpha-glucosidase for Pombe's disease. 

15 While the polypeptide of the invention is anticipated to exhibit therapeutic utility for the 

same purpose, it is believed that, due to the improved lysosomal activity of the polypeptide, it may 
be administered in dosages that are lower than with the current treatment. For GCB, the 
recommended dosage by the manufacturer is 60 units/kg body weight/2 weeks. The GCB 
polypeptide of the invention may therefore be administered at a dose approximately paralleling that 

20 employed in therapy with human GCB such as Cerezyme™ or a lower dose and/or less frequently 
than Cerezyme™. The exact dose to be administered depends on the circumstances. Normally, the 
dose should be capable of preventing or lessening the severity or spread of the condition or 
indication being treated. It will be apparent to those of skill in the art that an effective amount of a 
polypeptide or composition of the invention depends, inter alia, upon the disease, the dose, the 

25 administration schedule, whether the polypeptide or composition is administered alone or in 

conjunction with other therapeutic agents, the serum half-life of the compositions, and the general 
health of the patient. 

The polypeptide of the invention is preferably administered in a composition including a 
pharmaceutically acceptable carrier or excipient. "Pharmaceutically acceptable" means a carrier or 
30 excipient that does not cause any untoward effects in patients to whom it is administered. Such 
pharmaceutically acceptable carriers and excipients are well known in the art. 

The polypeptide of the invention can be formulated into pharmaceutical compositions by 
well-known methods. Suitable formulations are described by Remington's Pharmaceutical Sciences 
by E. W.Martin and US 5,183,746. 
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The pharmaceutical composition of the polypeptide of the invention may be formulated in a variety 
of forms, including liquid, gel, lyophilized, or any other suitable form. The preferred form will 
depend upon the particular indication being treated and will be apparent to one of skill in the art. 
The pharmaceutical composition containing the polypeptide of the invention may be 
5 administered orally, intravenously, intramuscularly, intraperitoneally, intradermally, 

subcutaneously, by inhalation, or in any other acceptable manner, e.g. using PowderJect or ProLease 
technology. The preferred mode of administration will depend upon the particular indication being 
treated and will be apparent to one of skill in the art. 

The pharmaceutical composition of the invention may be administered in conjunction with 

10 other therapeutic agents. These agents may be incorporated as part of the same pharmaceutical 
composition or may be administered separately from the polypeptide of the invention, either 
concurrently or in accordance with any other acceptable treatment schedule. For instance, when the 
polypeptide is a lysosomal enzyme such agent may be an activator thereof. When the lysosomal 
en2yme is GCB, SapC and/or SapA is one example of such agent. When the lysosomal enzyme is 

15 arylsulphatase A, SapB is an example of such agent. When the lysosomal enzyme is alpha- 
galactoisdase, SapB and/or SapD is an example of such agent. When the lysosomal enzyme is 
hexosaminidase, GM-2 activator is an example of such agent. 

Also contemplated is the use of a nucleotide sequence encoding a polypeptide of the 
invention in gene therapy applications. In particular, it may be of interest to use a nucleotide 

20 sequence encoding a polypeptide having at least one introduced in vivo glycosylate site. The 
glycosylation of the polypeptides is thus achieved during the course of the gene therapy,- i.e. after 
expression of the nucleotide sequence in the human body. 

Both in vitro and in vivo gene therapy methodologies are contemplated. Several methods for 
transferring potentially therapeutic genes to defined cell populations are known. For further 

25 reference see, e.g., Mulligan, "The Basic Science Of Gene Therapy", Science, 260, pp. 926-31 
(1993). These methods include: 

Direct gene transfer, e.g., as disclosed by Wolff et al., "Direct Gene transfer Into Mouse Muscle In 
vivo", Science 247, pp. 1465-68 (1990); 

Liposome-mediated DNA transfer, e.g., as disclosed by Caplen et al., "Liposome-mediated CFTR 
30 Gene Transfer to the Nasal Epithelium Of Patients With Cystic Fibrosis" Nature Med., 3, pp. 39-46 
(1995); Crystal, 'The Gene As A Drug", Nature Med., 1, pp.- 15-17 (1995); Gao and Huang, "A 
Novel Cationic Liposome Reagent For Efficient Transfection of Mammalian Cells", 
Biochem-Biophys Res. Comm., 179, pp. 280-85 (1991); 
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Retrovirus-mediated DNA transfer, e.g., as disclosed by Kay et al., "In vivo Gene Therapy of 
Hemophilia B: Sustained Partial Correction In Factor DC-Deficient Dogs", Science, 262, pp. 117-19 
(1993); Anderson, "Human Gene Therapy", Science, 256, pp.808-13(1992); 
DNA Virus-mediated DNA transfer. Such DNA viruses include adenoviruses (preferably Ad-2 or 

5 Ad-5 based vectors), herpes viruses (preferably herpes simplex virus based vectors), and 

parvoviruses (preferably "defective" or non-autonomous parvovirus based vectors, more preferably 
adeno-associated virus based vectors, most preferably AAV-2 based vectors). See, e.g., Ali et al., 
'The Use Of DNA Viruses as Vectors for Gene Therapy*', Gene Therapy, 1, pp. 367-84 (1994); US 
4,797,368, and US 5,139,941. 

10 The invention is further described in the following examples. The examples should not, in 

any manner, be understood as limiting the generality of the present specification and claims. 

15 MATERIALS 

GCB Activity Assay Buffer. 

120 mM phosphate/citrate buffer, pH=5.5, 1 mM EDTA, pH=8.0, 0.25 % Triton X-100, 0.25 % 
taurocholate, 4 mM 0-mercaptoethanol 

20 

pGC-12 vector 

pVL1392 (Pharmingen, USA) with GCB wt cDNA sequence (SEQ ID NO 2) inserted between 
EcoRV and Xbal. 

25 Table 1 

Sequence of primers used for cloning the wt GCB coding region and inserting signal peptides into 
the pGCBmat plasmid as described in Example 1. 

S049 (WT-sp-Bglll): 5'-CGCAGATCTGATGGCTGGCAGCCTCACAGGATTGC-3' 
30 SO50 (WT-stop-EcoRI): 5'-CCGGAATTCCCATCACTGGCGACGCCACAGGTAGGTG-3' 

5051 (WT-mature-SacI): 5'-ACGCGAGCTCGCCCCTGCATCCCTAAAAGCTTCGG-3' 

5052 (SPegt-Nhel/SacI-as): 5'- 

GCGTTGACGGCAGTCAGAGTTGACAGAAGGGCCAGCCAGCAAAGGATAGTCATG-3' 

5053 (SPegt-Nhel/SacI-s): 5'- 

35 CTAGCATGACTATCCITTGCTGGCTGGCCCTTCrTGTCAACT 
CT-3' 

5054 (SPegt-Nhel/SacI-as): 5'- 

CCTGCrACTGCTCCCAGCAGCAGTGAAAGAGTCCAAAGTGGCAGCATG-3' 

5055 (SPegt-Nhel/SacI-s): 5'- 

40 CTAGCATGCTGCCACTTTC<iACTCITrCACT 
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Table 2 

Primers used for introduction of N-glycosy]ation sites randomly as described in Example 2 

Written with the nucleotide sequence from 5' to 3'. 

SO60: CAGCTGGCCATGGGTACCCGG 

SO90: CCCTCCAAATCCCTTCACTTTCTGG 

SOI 16: GAGTTTTTGGTTCTTGCCGGGTCC 

S0128: CCTTCACTGTCTGGTTCTTCTGTTCTGGC 

SO130: CCGTCACGTTCTGGAACTTCTGTTCTGGC 

S0131: CCAAACCAGACCTTCCAGAAAGTGAAGGG 

S0132: CCTTCGTTTTGTTGAACTTCTGTTCTGGC 

S0133: CCAGAAAACAAGACCCAGAAAGTGAAGGG 

S0134: CCGGTTCCGTTTTCAGAGAAGTACGATTTAAG 

S0135: CCAGAACAGAAGTTCCAGAAAGTGAAGGG 

SO 136: ATTCCAGTTTCATTGAAGTACGATTTAAG 

S0137: GGTACCTTCAGCCGCTATGAGAGTACACG 

SO 138: ATTCCTTCGGTAGAGTTGTACGATTTAAG 

S0139: GGTAACTTCAGCCGCTATGAGAGTACACG 

SO 140: ATTCCTTCTTCAGAGAAGTTCGATTTAAG 

S0141: GGTACCAACAGCACCTATGAGAGTACACG 

S0142: GGTGTCTTGTTCTTGGTATCTTCCTCTGG 

S0143: GGTACCTTCAACCGCACCGAGAGTACACG 

S0144: GGTATCTTGGTCTTGTTATCTTCCTCTGG 

S0145: GGTACCTTCAGCAACTATACTAGTACACG 

S0146: GGTATCTTGAGCGTGGTATTTTCCTCTGG 

S0147: GGTACCTTCAGCCGCAATGAGAGTACACG 

S0148: GGTATCTTGAGCTTGGTATCTTCCTCTGG 

S0149: CCAGAGAACGATACCAAGCTCAAGATACC 

SO 150: CTGGK3TGTAGTTGTCCXXGGGCTGTCCCTTGAGTGACC 

SO 151: CCAAACGAAACTACCAAGCTCAAGATACC 

S0152: GTGGGTGATGTTCCCGGGCTGTCCCTIGAGTGACC 

SO 153: CCAGAGGAAGATACCAAGCTCAAGATACC 

SO 154: GTGGTAGATGTCCCCGGGCTGTCCCTTGAGTGACC 

SO 155: GGTCAAACAAGACACAGCCCGGGGACATCTACCAC 

S0156: CTGTCAGCACCGTCTTGTTCCAGTGGGGC 

S0157: GGTCACTCAAGGGACAGCCCGGGGACATCTACCAC 

S0158: CTGTGGTCACGTTCTTTGCCCAGTGGGGC 

S0159: GCCCAACTGGACTAAGGTGGTGCTGACAG 

SO160: CTGTCAGGTTCACCTTTGCCCAGTGGGGC 

S016 1 : GCCCC ACACCGCA ACCGTGGTGCTGACAG 

S0162: CTGTCAGCACCACCTTTGCCCAGTGGGGC 

S0163: GCCCCACTGGGCAAAGGTGGTGCTGACAG 

Cerezyme was kindly provided by Dr. E. Beutler, Scripps Institute, CA, USA 

J774E was kindly provided by G. Grabowski, Cincinnati, Ohio, US 

METHODS 

GCB Activity Assay using PNP-glucopyranoside or 4-MU-glucopyranoside substrate 

The enzymatic activity of recombinant GCB is measured using p-nitrophenyl-fl-D-glucopvranoside 

(PNP-Glu) or the fluorescent compound 4-rnethyluinbelliferyl-P-D-glucopyranoside (4-MUGlu) as a 
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substrate. Hydrolysis of the PNP-Glu substrate generates p-nitrophenyl, which can be quantified by 
measuring absorption at 405 nm using a spectrophotometer, as previously described (Friedmann et 
al., 1999, Blood 93; 2807-2816). Hydrolysis of the4MUGlu substrate generates 4- 
methylumbelliferone, which can be quantified by measuring fluorescence at 460 nm (exitation at 

5 360 nm) using a PolarStar Galaxy spectrofluorometer. The assay is carried out under conditions 
which partially inhibit non-GCB glucosidase activities, such conditions being achieve by using a 
phosphate/citrate buffer pH=5.5, 0.25 % Triton X-100 and 0.25 % taurocholate. 

The assay is run in a final volume of 200 ul, containing GCB Activity Assay Buffer and 4 
mM PNP-Glu or 3mM 4-MUGlu. The enzymatic hydrolysis is initiated by adding GCB and the 

10 reaction is allowed to proceed for 1 hour at 37°C before being stopped by adding 50 pi 1 M NaOH 
and measuring absorption at 405 nm. A reference standard curve of p-nitrophenyl or 4- 
methylumbelliferone, assayed in parallel, is used to quantify concentrations of GCB in samples to be 
tested. 

15 In vitro uptake and stability of GCB polypeptide in macrophages 

The murine monocyte/macrophage cells line, J774E (Mukhopadhyay and Stahl, Arch 
Biochem Biophys 1995 Dec l;324(l):78-84 and Diment et at. J Leukoc Biol 1987 Nov;42(5):485- 
90) is used to study the uptake and stability of GCB polypeptides. Cells are grown in alpha-MEM 
(supplemented with 10 % fetal calf serum, IX Pen/Strep, and 60 uM 6-thioguanine), seeded 

20 (200,000 cells pr. well) in the above-mentioned media containing 10 uM conditol B epoxide, CBE 
(an irreversible GCB inhibitor) and incubated for 24 hr at 37°C. 

Before starting the uptake assay, cells are washed in 0.5 ml HBSS (Hanks balanced salt 
solution). The uptake is done in a 200 ul volume, containing the appropriate concentration of GCB 
polypeptide (a dosis response curve is made with GCB concentrations in the range of 25-400 

25 mU/ml). As a control, yeast mannan (final concentration 1.4 mg/ml) is added to inhibit the uptake 
through the macrophage raannose receptor. The cells are incubated for 1 hr at 37°C and washed 
three times with 0.5 ml cold HBSS. 

To measure the amount of GCB taken up by the J774E cells, cells are lyzed in 200 ul GCB 
Activity Assay Buffer with 4 mM PMP-Glu and incubated for 1 hr at 37°C. Then, the hydrolysis is 

30 stopped by addition of 50 ul 1M NaOH and OD405 is measured. The data are analysed by non- 
linear regression using GraphPad Prizm 2.0 (GraphPad Software, San Diego, CA) 

To study the stability of GCB polypeptides in J774E cells, CBE treated cells are incubated 
with 400 mU/ml GCB for 1 hr at 37°C. Then, cells are washed 3 times in HBSS to remove 
extracellular GCB and incubated in HBSS. A time-course study is done by lyzing the cells after 30 

35 min, 1 hr, 2 hr, 3hr, 4 hr, and 5 hr in 200 ul GCB Activity Assay Buffer with 4mM PNP-Glu and 
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incubating the samples for 1 hr at 37°C before stopping the hydrolysis with 50 |al 1 M NaOH and 
measuring OD405. The data are analysed by non-linear regression using GraphPad Prizm 2.0 
(GraphPad Software, San Diego, CA). 

SapC activation ofGCB polypeptides 

Phosphatidyl serine from bovine brain is prepared for assay by being dissolved in 1 : 1 vol 
methanol: chloroform, drying it down in aliqouts and stored at -20°C. The day of the assay, an 
aliquot is dissolved and diluted in buffer (120 mM phosphate buffer pH=4.7, 1 mM EDTA, 2 mM P- 
mercaptoethanol) and sonicated for 10 min. GCB polypeptide activation by SapC was done in a total 
volume of 200 p.1 containing 1.25 mU/ml GCB polypeptide, 120 mM phosphate buffer pH=4.7, 1 
mM EDTA, 2 mM p-mercaptoethanol, 5 ng/ml phosphatidylserine, 4 mM PNP-GIu and SapC 
(produced as described in Example 4). The assay is done by pre-incubating GCB polypeptide, lipid 
and SapC for 20 min at room temperature before starting the assay by addition of the substrate. The 
reaction mixture is incubated for 1 hr at 37° C before the hydrolysis is stopped by addition of 50 fil 1 
M NaOH and measuring OD405. The data are analysed by non-linear regression using GraphPad 
Prizm 2.0. 

Assays for determination of increase in vivo activity/functional in vivo half-life 
Increased in vivo ac/ivity/functional in vivo half-life is measured using the uptake assays described 
below. The intracellular activity is measured at different time points after incubation with the GCB 
polypeptide and the time to which half of the initial activity is present is calculated using standard 
software programs, e.g. GraphPad Prizm 2.0. 

Alternatively, activity in different liver cells after infusion of GCB polypeptide into live 
animals is determined (Friedman et al. Blood 93, 2807-2816, 1999). Briefly, the GCB polypeptide is 
infused intravenously into animals. The animals are sacrificed at different time points after the 
infusion and different liver cell fractions isolated using a combination of Percoll (Sigma) 
centrifugation and magnet-based isolation of cells with phagocytic capacity. The amount of GCB 
activity retained in the cells after different time points is determined using the GCB Activity Assay 
as described above. Furthermore, Iysosomes can be isolated from these cells using further Percoll 
centrifugations and preferably magnetic chromatography in order to measure the lysosomal activity 
of the GCB polypeptide (Diettrich etAl. FEBS Letts. 1998:441 ;369-72). 

As an example, in vivo uptake of a GCB polypeptide is determined by giving 6-8 week-old 
Balb/c mice a single bolus injection into the tail vein nsing40 units GCB polypeptide per. gram 
body weight. As a control, mannosylated BSA are used to determine the endogenous level of GCB. 
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Measurement of serum half-life . For pharmacokinetics studies, tail vein bleeds (-10 
ul/bleed) are done every 10 seconds (up til 5-10 minutes after administration) and sera from these 
bleeds are assayed for GCB activity using the GCB Activity Assay. Serum concentration-time data 
are described by first order exponential equations and the serum half-life is calculated from this, 

5 

Organ dist ribution of GCB polypeptide. T o determine the organ distribution, animals are killed 20 
minutes post-injection. The liver, spleen, heart, lung, brain and kidneys are excised and tissue 
homogenates are prepared and assayed for GCB activity. The bio-distribution is given as GCB 
activity recovered per gram wet weight tissue. 

10 

Hepatocellular distribution : Mice are administered a single bolus tail vein injection of GCB 
polypeptide or mannosylated BSA (controls). 20 min, lh, 3h, 8h, 16h, 24h, 36h, 72h, and 144h 
minutes postinfection, livers of anesthetized mice are perfused in situ with PBS and collagenase D 
and the different liver populations (parenchymal, kupffer and endothelial cells) are separated as 
15 previously described (Friedmann et al., 1999, Blood 93; 2807-2816) or by magnetic cell separation 
(MACS) using cdllb microbeads (Miltenyi Biotec Inc.). These separated cell populations are then 
assayed for GCB activity, using the GCB Activity Assay and the data are given as: 1) GCB activity 
per gram liver and 2) GCB activity per 10 fi cells per gram liver. 

20 Isolation of Kupffer cells 

Mice were euthanized and livers perfused in situ via the portal vein, with 0.5 u/ml 
collagenase solution (Collagenase D No. 108882, Roche Diagnostics) for 4-5 minutes. Liver was 
then removed and submerged in 3 ml collagenase solution where it was gently minced and the 
collagenase was allowed to digest the liver tissue for 1 hour at 37°C on a rocking table. 

25 After 1 hour of digestion the liver solution was gently homogenizing using a 5 ml 

serological pipette and PBS was added to a total of 10ml. In order to remove undigested tissue and 
get a single cell suspension the solution was filtered through gaze and then through a 60Dm nylon 
mesh. 

This single-cell-liver solution was centrifuged by 1800rpm, lOmin, 18°C, supernatant 
30 removed and the pellet resuspended in PBS, 0,5 % bovine serum albumin (BSA), 2 mM EDTA. For 
further purification the cell suspension was centrifuged through a 20% icecold Percoll solution 
(l,031g/ml) at 1600rpm, 5 min, 20°C in a swing-bucket centrifuge without brakes. The resulting 
upper layer and interface, containing dead cells and debris, was removed. The purified liver cell 
fraction, consisting of hepatocytes, Kupffer cells and endothelial cells, was on the bottom of the 
35 tube. This fraction was washed twice with PBS, 0,5% BSA, 2mM EDTA and centrifuged by 
lfiOOrpm, 5min, 20°C. 
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The Kupffer cell fraction was isolated according to manufacturer's instructions, using an anti- 
MHC Class Il-conjugated magnetic bead over aLS+ MidiMACS separation column (Miltenyi Inc.). 
Briefly, after the last centrifugation the cell fraction was resuspended in 0.45 ml PBS, 0.5% BSA, 
2mM EDTA followed by addition of anti-MHC Class Il-conjugated magnetic beads and the anti- 
5 MHC Class H-positive cells where eluted with PBS , 0.5% BSA, 2mM EDTA. The eluted cell 
fraction, consisting of Kupffer cells, was finally concentrated by centrifugation 1600rpm, 5min, 
20°C and resuspended in a small volume of PBS 0.5% BSA, 2mM EDTA. 

Approximately 1.2 x 10 6 Kupffer cells were obtain from one liver. GCB activity was determined by 
use of the PNP GCB Activity Assay. 

10 

Proteolytic stability 

The proteolytic stability of a GCB polypeptide is measured by incubating the polypeptide (e.g. a 
mutein) and the reference (e.g. wt GCB) with extracts of rat liver lysosomes at pH 4.5 to 5.0. The 
incubation is run from 1 to 24 hours with samples taken out every 10 to 60 minutes and the left over 
15 enzymatic activity is determined using the PNP assay. The proteolytic half-life of wt and mutein is 
then determined. A method for the preparation of the lysosomal extracts for digestion of proteins is 
given by Coffey and de Duve, J. Biol. Chem. 243, pp. 3255-3263, 1968. 

Site-directed mutagenesis 
20 Constructions of site-directed mutations were performed using PCR with oligonucleotides 

containing the desired amino acid exchanges or additions (e.g. to introduce glycosylation sites). The 
resulting PCR fragment was cloned into the GCB expression vector using approparite restriction 
enzymes and subsequently DNA sequenced in order to confirm that the construct contained the 
desired exchanges. 

25 

EXAMPLES 

EXAMPLE 1 

30 PRODUCTION OF WT GCB 

Cloning and Expression in Insect Cells 

A human fibroblast cDNA library was obtained from Clontech (Human Fibroblast skin cDNA 
cloned in lambda-gtl 1, cat# HL1052b). Lambda DNA was prepared from the library by standard 
35 methods and used as a template in a PCR reaction with either S049 and SO50 as primer (amplifies 
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the GCB coding region with the human signal peptide from the second ATG) or SO50 and S 051 as 
primer (amplifies the mature part of the GCB coding region) (see Table 1 in the Materials section). 

The PCR products were reamplified with the same primers and agarose gel purified. 
Subsequently the SO49/50 PCR product was digested with Bgin and EcoRI and cloned into the 
5 pBlueBac 4.5 vector (Invitrogen, Carlsbad, CA, USA, Carlsbad, CA, USA) digested with BamHI 
and EcoRI. Sequencing confirmed that the insert is identical to the wtGCB sequence as given in 
SEQ ID NO 2. The resulting plasmid was used for infection of insect cells with the GCB being 
partly secreted from the cells due to the human signal sequence as described in Martin et al., DNA 7, 
pp. 99-106, 1988. The SO50/51 PCR product was digested with Sad and EcoRI and cloned into the 

10 pBlueBac 4.5 vector (Invitrogen, Carlsbad, CA, USA) digested with the same enzymes resulting in 
the pGCBmat plasmid. Two different signal sequences were inserted upstream of the mature GCB 
codons in order to increase the secreted amount of enzyme. The baculovirus ecdysteroid 
UDPglucosyltransferase (egt) signal sequence (Murphy et al., Protein Expression and Purification 4, 
349-357, 1993) was inserted by annealling S052 and S053 (Table 1) and the human pancreatic 

15 lipase signal sequence (Lowe et al., J. Biol. Chem. 264, 20042, 1989) was inserted by annealling 
S054 and S055 (Table 1) and cloning them into the Nhel and SacI digested pGCBmat plasmid. 
Infection of Spodopterafrugiperda (Sf9) cells of the resulting plasmid was done according to the 
protocols from Invitrogen, Carlsbad, CA, USA. 

20 Purification of GCB polypeptides produced in insect cells 

Polypeptides with GCB activity were purified as described in US 5,236,838, with some 
modifications. Cells were removed from the culture medium by centrifugation (10 min at 4000 rpm 
in a Sorvall RC5C centrifuge) and the supernatant microfiltrated using a 0.22 fim filter prior to 
purification. DTT was added to 1 mM and the culture supernatant was ultrafiltrated to 

25 approximately 1/10 of the starting volume using a Vivafiow 200 system (Vivascience). The 
concentrated media was centrifuged to remove possible aggregates before application on a 
Toyopearl Butyl650C resin (TosoHaas) previously equilibrated in 50 mM sodium citrate, 20 % (v/v) 
ethylene glycol, 1 mMDTT, pH 5.0. This chromatographic step was performed at room 
temperature. The resin was washed with at least 3 column volumes of 50 mM sodium citrate, 20 % 

30 (v/v) ethylene glycol, 1 mM DTT, pH 5.0 (until the absorbance at 280 nm reaches baseline level) 
and GCB was eluted with a linear gradient from 0% to 100% 50 mM sodium citrate, 80% (v/v) 
ethylene glycol, 1 mM DTT, pH 5.0. Fractions were collected and assayed for GCB activity using 
the Activity Assay (PNP-Glu). Usually, wt GCB starts to elute at approx. 70% (v/v) ethylene glycol. 
The subsequent purification was done by either of the following two methods. #2 method 

35 results in GCB of a higher purity. 
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Method #1 

GCB enriched fractions from the first process step were pooled and diluted approx. 4 times with a 
buffer containing 50 mM sodium citrate, 5 mM DTT, pH 5.0 to reduce the ethylene glycol content 
5 to 20% (or lower). In the second HIC purification step the diluted and partially purified GCB was 
applied on a Toyopearl phenyl resin (TosoHaas) equilibrated in 50 mM sodium citrate, 1 mM DTT, 
pH 5.0 (Buffer A) before use. After application, the resin was washed with at least 3 column 
volumes of 50 mM sodium citrate, pH 5 (until the absorbance at 280 nm reaches baseline level) and 
GCB was then eluted with a linear ethanol gradient from 0% to 100% buffer B (50 mM sodium 
10 citrate, 50% (v/v) ethanol, 1 mM DTT, pH 5.0). Highly purified fractions of GCB (wildtype S: 95% 
pure), identified using the GCB Activity Assay, start to elute at approx. 40% ethanol. The purified 
GCB bulk product was dialyzed against 50 mM sodium citrate, 0.2 M mannitol, 0.09% tween80, pH 
6. 1 to retain the GCB activity upon subsequent storage at 4-8°C or at -80°C. 

15 Method #2 

GCB enriched fractions eluted from the Toyopearl butyl650C resin were pooled and applied at 4°C 
on a SP sepharose resin ( Amersham Pharmacia Biotech) previously equilibrated in 25 mM sodium 
citrate, 1 mM DTT, 10% ethylene glycol, pH 5.0: After application, the resin was washed with 25 
mM sodium citrate, 1 mM DTT, 10% ethylene glycol, pH 5.0 (until absorption at 280 nm reached 

20 baseline level) and GCB was then eluted with a linear gradient from 0 tol00% 0.25 M sodium 
citrate, 1 mM DTT, 10% ethylene glycol, pH 5.0. GCB begins to elute around 0.15 M sodium 
citrate. Fractions containing GCB were pooled and applied at room temperature onto a Phenyl 
sepharose High Performance (Pharmacia Biotech) previously equilibrated in 25 mM sodium citrate 
l.mM DTT, pH 5.0. After application, the resin was washed with 25 mM sodium citrate 1 mM DTT, 

25 pH 5.0 until absorption at 280 nm reached baseline level, and GCB was then eluted with a linear 
ethanol gradient from 0 tol00% 25 mM sodium citrate 1 mM DTT 50 % ethanol pH 5.0. GCB 
typically elutes around 35 % ethanol. 

The purified GCB bulk product was dialyzed against either 50 mM sodium citrate, 1 mM DTT, pH 
5.0 or 50 mM sodium citrate, 0.2 M mannitol, 1 mM DTT, pH 6.1 to retain the GCB activity upon 
30 subsequent storage. The purified GCB was concentrated and sterilfiltrered before storage at 4 - 8°C 
or at -80°C. Typically, GCB purified by this method is >95% pure. 



35 
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EXAMPLE 2 

RANDOM INTRODUCTION OF GLYCOSYLATION SITES IN wtGCB 

In order to introduce glycosylation sites randomly in specified regions of the GCB cDNA, a primer 
5 was made for each glycosylation site to be introduced into the region. A series of PCRs were . 
performed with mixtures of primers, as follows: 

Equimolar amounts of the following primer mixtures were used in the PGR: 

Randoml: SO90 (wt) +128 +130 +132 
10 Random2; S0131 +133 +135(wt) 

RandomS; SO!42+144+146+148(wt) 

Random4: S0149 +151 +153(wt) 

RandomS: SO150 +152 +154(wt) (Smal) 

Random6: S0155 +157(wt) (Smal) 
15 Random7: S0156+158 +160+162(wt) 

RandomS: S0159 +161 +163(wt) 

RandomA: SO60(wt) +134 +136 +138 +140 

RandomB: S0137(wt) +139 +141 +143 +145 +147 

The primers are listed in Table 2 in the Materials section. 

20 Approximately 100 ng of the wtGCB cDNA is added as template and the PCR is performed under 
standard conditions. The length of the resulting product is indicated in parenthesis following the 
primers. Figure 5 schematically illustrates the relative locations of the primers and PCR spanning 
the GCB cDNA. 

PCR1A: Randoml + PBR10 (390bp) 
25 PCR1B: Random2 + Random3 (240bp) 

PCR 1 C: RandomA + RandomB (240bp) 

PCR ID: Random4 + RandomS (165bp) 

PGR IE: Random6 + Random7 (3 lObp) 

PCR1F: Random8 + SOI 1 6 (620bp) 
30 PCR2:SO116 + PBR10 (1650 bp) 

Products from reactions PCR1A-F were purified from an agarose gel using the Qiagen 

agarose gel purification kit, and approximately molar amounts were used in a second round of PCR 

using primers SOH6 and PBR10 to reassemble the entire GCB cDNA in a 1650 bp product with a 

variable number of introduced glycosylation sites. The product from the second PCR was digested 

35 with Nhel and EcoRI to yield a 1560 bp fragment and directionally cloned into the Nhel/EcoRI sites 
of the pGC-12 vector. The ligation wsa transformed into competent E.coli cells and 1/100 of the 
transformation was plated onto LB agar containing ampicillin. The remaining 9/10 is grown in LB- 
Arap overnight and the genomic DNA of the resulting bacteria was isolated and used to produce a 
plasmid library containing variant GCB cDNAs with different numbers and locations of 

40 glycosylation sites. 
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Plasmid minipreps were then selected at random and sequenced to determine the mutation 
frequency. If the sequencing revealed a suboptimal level of diversity, the process could be repeated. 
When a desirable level of diversity was obtained, the plasmid library was transfected into insect 
cells (Spodopterafrugiperda Sf9 cells) as described in, e.g., protocols published by Invitrogen, 
5 Carlsbad, CA. The resulting transfectants are screened for enzymatic activity using the GCB 
Activity Assay (PNP). Individual clones are then evaluated, e.g., for enzyme activity and/or cell 
uptake. 

10 EXAMPLE 3 

Preparation of GCB with N-terminal peptide additions using a site-directed mutagenesis approach 
Nucleotide sequences encoding the following N-terminal peptide additions were added to the 
nucleotide sequence shown in SEQ ID NO 2 encoding wtGCB: (A-4)+(N-3)+(I-2)+(T-l) 
15 (representing an extension to the N-terminal of the amino acid sequence shown in SEQ ID NO 1 
with the amino acid residues ANIT), and (A-7)-KS-6>(P-5)+(I-4)+(N-3)+(A-2)+(T-l) (ASPINAT). 

A nucleotide sequence encoding the N-terminal peptide addition (A-4)+(N-3)+(I-2)+(T-l) 
was prepared by PCR using the following conditions: 
PCR1: 

20 Template: 10 ng pBlueBacS with wt GCB cDNA sequence 
primer SO60: 5'-C^GCTGGCCATGGGTACCCGG-3' and 

primer S085: 5 ' -TGGGCATCAGGTGCC AACATTAC AGCCCGCCCCTGCATCCCTAAAAGC- 
3' 

BIO-X-ACT™ DNA polymerase (Bioline, London, U.K.) 
25 lxOptiBuffer™ (Bioline, London, U.K.) 
30 cycles of 96°C 30s, 55°C 30s, 72°C I min 
PCR 2: 

Template: 10 ng pBlueBacS with wt GCB, 

Baculo virus forward primer: 5'-TTTACTGTTTTCGTAACAGTTTTG-3' and 
30 primer S086: 5*- GCAGGGGCGGKjCTGTAATGTTCiGCACCTGATGCCCACGACACTGCCTG- 
3' 

BIO-X-ACT™ DNA polymerase (Bioline, London, U.K.) 
lxOptiBuffer™ (Bioline, London, U.K.) 
30 cycles of 96°C 30s, 55°C 30s, 72 C C 1 min 
35 PCR 3: 

3 ul of agarose gel purified PCR1 and PCR2 products (app. 10 ng) 
Baculo virus forward primer: 5 * -TTT ACTGTTTTCGTAACAGlXfTG-3 ' 
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primer SO60: 5'-CAGCTGGCCATGGGTACCCGG-3' 
BIO-X-ACT™ DNA polymerase (Bioline, London, U.K.) 
lxOptiBuffer™ (Bioline, London, U.K.) 
30 cycles of 96°C 30s, S5°C 30s, 72°C 1 rain 
5 PCR 3 was agarose gel purified and digested with Nhel and Ncol and cloned into 

pBIuebac4.5+wtGCB digested with Nhel and Ncol. 

After confirmation of the correct mutations by DNA sequencing the plasmid was transfected 
into insect cells using the Bac-N-Blue™ transfection kit from Invitrogen, Carlsbad, CA, USA. 
Expression of the muteins was tested by western blotting and by activity measurement of the 
10 muteins using the GCB Activity Assay. 

Enzymatic activity in the PNP assay of wtGCB (SEQ ID NO 1) expressed in the expression 
vector pVL1392 in insect cells (Sf9) using an analogous method to that described in Example 1 gave 
13 units/L, while the N-terrninal peptide addition ASPINAT gave 28.5 units/L. 

15 Construction of libraries of GCB with N '-terminal peptide addition 

Using random mutagenesis two different libraries were constructed on the basis of GCB 

polypeptides with an N-terminal extension - library A with an N-terminal extension encoding the 

following amino acid sequence AXNXTXNXTXNXT, and library B with an N-terminal extension 

encoding ANXTNXTNXT. 

20 

Primers for library A were designed: 
S0167: 5'- 

GTGTCGTGGGCATCAGGTGCC^(G/C)AA(aT)(T/A/G)N(G/C)AC(A/T/C)(T/A/G)N(G/C)A 
A((yT)<T/A/G)N(G/C)AC(A/T^ 
25 CTGCATCCCTAAAAGC 

S0168 : 5 ' -GGCACCTGATGCCC ACGACACTGCCTG 

Primers for library B were designed using trinucleotides in the random positions. 
X is a mixture of trinucleotide codons for all natural amino acid residues, except proline. The 
30 trinucleotide codons used were the same as described by Kayushin et al„ Nucleic Acids Research, 
24, 3748-3755, 1996. 

S0165: 5'- 

CGTGGGCAT(^CK3TGCCAAaX)AC(An'/C)AA(OT)(X)AC(An , /C)AA(OT)(X)AC(A^'/C)G 
35 CCCGCCCCTGCATCCCTAAAAGC 

S0166 : 5'- GTTGGCACCTGATGCCCACGACACTGCCTG 

For both libraries: 

SO60: 5'- CAGCTGGCCATGGGTACCCGG 
40 pBRlO: 5'- TTT ACT GTT TTC GTA ACA GTT TTG 
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In all PCR reactions BIO-X-ACT™ DNA polymerase (Bioline, London, U.K.) and l*Optibuffer™ 
(Biofine, London, U.K.) were used. The PCR conditions were 30 cycles of 94 D C 30s, 55°C 1 min, 
and72°C 1 min. 

Templates and primers used for preparing a nucleotide sequence encoding the N-teiminal extension 
5 by the above PCR were as follows: 

PCR 1A: 
Template: pGC12 
Primers: SO60 + S0167 

10 

PCR IB: 

Template: pGC12 
Primers: SO60 + S0165 

15 PCR 2A: 

Template: pGC12 
Primers: S0168 + pBR10 

PCR2B: 
20 Template: pGC12 

Primers: S0166 + pBRlO 

PCR3A: 

Template: 1 pi of agarose gel purified PCR 1A and 2A products 
25 Primers: SO60 + pBRlO 

PCR3B: 

Template: 1 fil of agarose gel purified PCR IB and 2B products 
Primers: SO60 + pBRlO 

30 

PCR 3A and 3B were agarose gel purified and digested with Nhel and Ncol and ligated into pGC-12 
digested with Nhel and Ncol. The ligation mixture is transformed into competent K coli as 
described in Example 2. The diversity of the library was examined by DNA sequencing of different 
E. coli clones and gave rise to the following amino acid sequences: 

35 

Library A: 

1: AFNXTLNKTWN(F/L)T 
2: TMNNTWNWTWNWT 
3: -EXT wt 
40 4: ALNSTGNLTVDGT 
5: ASNSTFNLTENLT 
6: TRNVTTNCTUNST 
7: -EXT wt 

8: ALNWTYNGTKNVT 
45 9: AANWTVNFTGNFT 
10: -EXT wt 
11: AXNXTVNSTUNVT 
12: ANNFTFNGTLNLT 
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13: AGNWTANVTVNVT 

14: AGNSTSNVTGNWT 

15: AVNSTMN1HAIPP { 1 deletion - nonsens) 

16: AGNGTVNGTINGT 
5 17: AVNSTGNXTGNWT 

18: AGNGTUNGTSNLT 

19: -EXT wt 

20: AMNSTKNSTLNIT 

21: AFNYTSKNST 
10 22: -EXT wt 

23: AVNATMNWTANGT 

24: ASNSTNNGTLNAT 

25: ARNKTKNFTINLT 

26: APNTTUNDTVNMT 
15 27: AQNKTFNFTMNCT 

28: ALNVTWNCTLNLT 

29: ALNTTWTNLT 

Library B: 
20 1: ANTTNFTNET 

2: ANWTNRTNCT 

3: ANWTNFTNWT 

4: PTGLIGTNFT 

5: ANWTNKTNFT 
25 6: ANNTNLTNAT 

7: ANYTNWTNFT 

8: ANTTNQTNDT 

9: - EXT wt 

10: ANRTNWTNTT 
30 1 1 : PTATNHTNST 

12:-EXTwt 

1 3: ANWTNQTNQT 

14: ANWTNWTNAT 

15: ANFTNKTNMT 
35 16: ANHTNETNAT 

17: AN(C/W)TNFTNET 

18: ANLDKLHKUH (insertion - nonsens) 

19: ANCFTNQTNFT 

20: ANWTNWTNEWT 
40 21: ANCTNWTNCT 

22:-EXTwt 

23:-EXTwt 

24: CHPYNWTNWT 

25: ANETNYTNET 
45 26: ANWTNWT 

27: AKPYKSYKFY (insertion - nonsens) 

28: ANITNKTNWT 

29: ANWTNMTNTT 

30: ANNTNRTNPT 
50 31: ANWTNWTNWT 

32: ANWRTNHTNKT 

33: -EXT wt 

34: ANQTNITNWT 
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Library B was transfected into insect cells using the Bac-N-BIue™ transfection kit from 

Invitrogen, Carlsbad, CA, USA. First, 96 plaques from Library B were picked and tested by activity 

measurement (PNP GCB Activity Assay). Plaques were selected as follows: 3 with high activity, 3 

5 with medium activity and 3 with low or no activity, and virus was purified for DNA sequencing 

resulting in the following amino acid sequences: 

High activity: 

1-1: Mixed sequence 

1- 2: ANFTNVATNQT 

10 1-3: (A)(N)TTXLTN(K)T 

Medium activity: 

2- 1: ANKTN(S/C)TNIT 

2- 2: Mixed sequence 
15 2-3: ANWTNCTNflOT 

Low activity: 

3- 1: ANWTN(F/L)TNWT 
3-2: CQLDURSTNET 

20 3-3: No sequence 

From both libraries 96 plaques were picked and tested by activity measurement (PNP GCB Activity 
Assay). From each library 6 plaques with high activity were selected and virus were purified for 
DNA sequencing. The amino acid sequence encoded by the different clones were: 

25 

Library A: 
1: Mixed sequence 
2: Mixed sequence 
3: Mixed sequence 
30 4: WT 

5: ANNTNYTNWT 
6: ANNTNYTNWT 

Library B: 
35 1: AANDTUNWTVNCT 
2: ATNITLNYTANTT 
3:WT 

4: AANSTGNITINGT 
5: AVNWTSNDTSNST 

40 

The activity of the positives after plaque purification are shown in Table X in Example 6 below. 
EXAMPLE 4 
45 PRODUCTION OF SAPC 
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Expression of a synthetic Sap C gene in E. coli 

A plasmid expression vector for expression of Saposin C with a His-tag was kindly obtained from 
Dr. Gregory A. Grabowsld, Cincinnati, Ohio. The plasmid is described in Qi et al. J. Biol. Chem. 
269, 16746-16753, 1994, and the expression of it in the E. coli strain BL21(DE3) was performed as 
5 described in the same paper. 

Purification 

Cell pellets from E. coli expressing recombinant Saposin C were solubilized in binding buffer (10 
mM Tris, 0.5 M NaCl, 20 mM Imidazol, pH 7.9) containing one tablet of "Complete" protease 

10 inhibitor Cocktail (Roche) per 50 ml, and sonicated on ice on a U200S sonicator (DCA) at 80% 

amplitude for 4 times 20 seconds. The sonicate was centrifuged in a Sorvall RC5C centrifuge with a 
SS34 rotor at 12000 rpm for 15 minutes at 4°C The supernatant was filtered through a 0.45 um 
filter and applied onto a Ni-loaded HiTrap™ Chelating column (Pharmacia) previously equilibrated 
in binding buffer. The resin was washed with binding buffer until the absorption at 280 nm reached 

15 baseline levels, and bound protein was eluted using a linear gradient from 0-100% B buffer (10 mM 
Tris, 0.5 M NaCl, 0.5 M Imidazol pH 7.9). Fractions enriched in Saposin C were pooled and 
ammonium sulfate was added to 0.75 M before application onto a Toyopearl Butyl 650S resin 
previously equilibrated in 10 mM Tris pH 7.9, 0.75 M ammonium sulfate. After application, the 
resin was washed in 10 mM Tris pH 7.9, 0.75 M arnmonium sulfate until absorption at 280 nm 

20 reached baseline levels. Bound protein was eluted using a linear gradient from 0-100% B (10 mM 
Tris pH 7.9 Saposin C, elutmg around 0. 10 M ammonium sulfate, was pooled and the buffer was 
exchanged on a Vivaspin20 (Vivascience) to 50 mM sodium Citrate pH 5.8. The protein sample was 
sterile-filtered before storage at -80°C. 

25 EXAMPLE 5 

Construction of a Saposin C-GCB fusion polypeptide 

Fusion polypeptides of wtSaposin C (SEQ ID NO 3) and wtGCB (SEQ ED NO 1, wherein X is R) 
were constructed using standard cloning methods known in the art by making one nucleotide 
30 sequence expressing either of the following polypeptides: 

SaposinC-linkerpeptidel-GCB or GCB-linkerpeptide2-SaposinC 

The composition of specific fusion polypeptides (pGC-53, pGC-54, pGC-64, pGC-65 and pGC-73) 
35 are given in table 3 in Example 6. 

An example of the amino acid sequence of the fusion polypeptide of the type SaposinC- 
linkerpeptide -GCB is shown as SEQ ID NO 4. 



SUBSTITUTE SHEET (RULE 26) 



WO 01/49830 



PCT/DKOO/00743 



65 

EXAMPLE 6 

5 PROPERTIES OF GCB POLYPEPTIDES OF THE INVENTION 

GCB polypeptides of the invention were tested for various properties, including GCB activity, 
stability in J774E cells and uptake in J774E cells. Unless otherwise stated the properties were tested 
by use of the methods described in the Methods section herein. 
10 In table 3 below the GCB activity of various GCB polypeptides of the invention is listed. 



# Activity after 

Glycosylati Plaque 
on sites Isolation 



Plasmid 


Vector 


Mutations 


introduced (U/L) 


pGC-1 


PBlueBac4.5 


Wt 


0 6 


pGC-2 


pBlueBac4.5 


K194N 


1 16 


pGC-3 


pBlueBac4.5 


K194T 


1 6 


pGC-4 


pBlueBac4.5 


K224N, Q226T 




pGC-5 


pBlueBac4.5 


K293N, V295T 


1 No plaques 


pGC-6 


pBlueBac4.5 


N-termANIT 


1 3 


pGC-7 


pBIueBac4.5 


E41N 


1 2 


pGC-8 


pVL1392 


K74N, Q76T 


I 31 


pGC-9 


pVL1392 


A84N 


1 0.05 


pGC-10 


pBlueBac4.5 


K321N 


1 No plaques 


pGC-12 


pVL1392 


Wt 


0 13 


pGC-13 


pVL1392 


N-termASPINAT 


1 29 


pGC-14 


pVL1392 


K7N, *9T 


1 0.2 


pGC-15 


pVL1392 


K106, Y108T 


1 0.2 


pGC-16 


pVL1392 


K194N, Q200T 


1 0.4 


pGC-17 


pVL1392 


H206N 


1 0.3 


pGC-18 


pVL1392 


E222N, K224T 


1 6 


pGC-19 


pVL1392 


K303N, V305T 


1 1.5 


pGC-21 


pVL1392 


K293N, V295T 


I 29 


pGC-22 


pVL1392 


K321N 


1 24 


pGC-27 


pVL1392 


T132N 


1 9 
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pGC-28 pVL1392 

pGC-36 pVL1392 

pGC-37 pVL1392 

pGC-38 pVL1392 

pGC-39 pVL1392 

pGC-40 pVL1392 



pGC-45 
pGC-47 
pGC-48 
pGC-52 
pGC-53 
pGC-54 



pVL1392 
pVL1392 
pVL1392 
pVL1392 
pVL1392 
pVL1392 



pGC-56 P VL1392 



pGC-57 
pGC-58 
pGC-60 
pGC-ol 
pGC-62 
pGC-63 
pGC-64 
pGC-65 
pGC-66 
pGC-o7 
pGC-68 
pGC-69 



P VL1392 
P VL1392 
pVL1392 
pVL1392 
P VL1392 
pVLI392 
pVL1392 
pVL1392 
pVL1392 
P VL1392 
pVL1392 
pVLI392 



pGC-70 pVL1392 

pGC-71 P VL1392 

pGC-72 pVL1392 

pGC-73 pVL1392 



I130N 1 
N-term: ASPINATSPINAT 2 
K194N, K321N 2 
N-term: ASPINAT.K194N, K321N 3 
T132N,K293N,V295T 2 
N-term: ASPINAT.T132N, K293N, 
V295T 3 
N-term: ASPINAT, K194N, E222N, 
K224T, K321N 4 
N-term: AGNGTVNGTINGT 3 
N-term: ASNSTNNGTLNAT 3 
R495H 

Saposin C-(GGGGS) 3 linker-GCB 
GCB-GGGG linker-Saposin C 
N-term: ASPINATSPINAT, K194N, 
K321N 4 
N-term; ASPINAT, T132N, K194N, 
K321N 

N-term: ASPINAT, T132N, K194N 
N-term: ANNTNYTNWT 
N-tenn: ATNTTLNYTANTT 
N-term: AANSTGNTTINGT 
N-term: AVNWTSNDTSNST 
GCB-(GGGGS)3 linker-Saposin C 
GCB-GNAT linker-Saposin C 
Q166N, A168T 
D218N, Y220T 
AN N-term extension + R2T 
K77N,K79T 
T132N, K194N, K293N, V295T, 
K321N 4 
N-term: ASPINAT.T132N, K194N, 
K293N, V295T, K321N 5 
P28N, P29L 1 
GCB-Sap C (no linker) 



7 

16 

13 

16 

3 

3.5 
13 



27 



P2: 14 
P2:38 
P2: 35 
P2: 66 
67 
54 
79 

37 
17 



13 
16 
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"Table 3: The plasmid column shows the number of the GCB polypeptide. The vector column shows 
the plasmid vector used for expression of the polypeptide. The mutation column shows the amino 
acid exchanges of the GCB polypeptide. N-terminal extentions are described as N-term followed by 
the amino acid residues that makes up the extension. Constructs for expression of fusion proteins of 
Saposin C and GCB are described in the order that they are fused and the amino acid residues 
making up the linker linking the two polypeptides together. The Activity column gives the units per 
liter of GCB activity measured by the GCB Activity Assay (PNP-Glu) on the supernatant from Sf9 
insect cells infected with one single plaque and grown in 3 ml of media in a 6-well plate. Those 
labelled with P2 are activity measured of supernatant from virus infection cells grown in 15 ml T75 
flasks." 

The uptake and stability of selected GCB polypeptides are shown in Figures 1 and 2, 
respectively. 



X Labels 


V 




Km 




Y 


SD 


N 


Y 


SD 


N 


WT 


0.572 


0.101 


3 


87.680 


>3.21 1 


3 


Cerezyme 


0.518 


0.144 


2 


91.915 


2.666 


2 


pGC36 


0.599 


0.010 


2 


70.590 


12.557 


2 


PGC37 


0.449 


0.000 


1 


36.300 


0.000 


1 


PGC38 


0.478 


0.000 


1 


43.980 


0.000 


1 


pGC45 


0.371 


0.000 


1 


27.520 


0.000 


1 


pGC54 


0.871 


0.139 


3 


79.073 


6.450 


3 


pGC56 


0.392 


0.000 


1 


32.170 


0.000 


1 


PGC59 


0.362 


0.000 


1 


30.900 


0.000 


1 


pGCSO 


0.566 


0.156 


2 


79.133 


14.030 


3 


pGC61 


0.738 


0.105 


2 


100.510 


16.674 


2 


PGC62 


0.860 


0.000 


1 


110.800 


0.000 


1 


pGC63 


0.513 


0.100 


2 


83.105 


6.456 


2 



Table 4: Calculated Vmax and KM for the different GCB 
polypeptides. Vmax and KM was calculated from dosis- 
response curves (see figure 1). 



For the dosis response curves (figure 1), a V^and a K M for uptake was calculated for each of the 
selected GCB polypeptides (see table 4, wherein Y is the actual value, SD the standard deviation and 
N the number of assays). As can be seen from table 4, an increase in was observed for the 
fusion protein (pGC54) and for the N-temrinally extended GCB polypeptides (pGC60, pGC61, 
pGC62, and pGC63) while the K M was unchanged. 
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Furthermore, the muteins were also tested for their stability in J774E cells (figure 2) and a 
half-life was calculated to be between 50 and 100 sec. 

Activation of the different GCB polypeptides by phosphatidyl serine from bovine brain was 
also tested and a KD was calculated. As can be seen in figure 3, the GCB-saposin C fusion protein 
5 (pGC54) was far more active compared to Cerezyme and the WT GCB polypetide (a 6.8 and 5.2 
fold change in KD, respectively). 

Also, the ability of saposin C to activate a set amount of the different GCB polypeptides was 
also tested in the presence of 5 Hg/ml phosphatidyl serine. As can be seen in figure 4A, the basal 
activity of the fusion protein (pGC54) was higher compared to the WT polypeptide and Cerezyme. 

10 

EXAMPLE 7 

PEGylation of GCB polypeptides 

15 GCB polypeptides were PEGylated using activated PEG-succinimidyl propionate (SPA-PEG) 

(Shearwater) in a buffer containing 0.1 M sodium phosphate pH 7.0. PEG was present in 5-120-fold 
molar excess with respect to the lysines, and protein concentration was 0.8-1.3 mg/ml. The reaction 
was carried out in 50-120 \xl batches at room temperature for 1 hour with agitation, and quenched 
using a 20-fold excess of glycine. Following the conjugation reaction, excess glycine and PEG were 

20 removed by dialysis. 

Using the above method rGCB was conjugated with activated SPA-PEG (Mw 5000 Da). 
rGCB in 50 mM sodium citrate, 0.2 M mannitol, 0.09% tweenSO, pH 6.1 was dialyzed with 0.1 M 
sodium phosphate buffer solution, pH 7.0, using a Vivaspin 500 (Vivascience) resulting in a final 
GCB concentration of 1.7 mg/ml. 25 ul SPA-PEG was solubilized in 0.1 M sodium phosphate 

25 buffer solution pH 7.0 to a concentration of 88 mg/ml and immediately added to an equal volume of 
the enzyme solution, giving a 20 fold excess of PEG with respect to lysines. The reaction was 
incubated at room temperature for 1 hour with agitation. The reaction was quenched by adding 20 
fold molar excess of glycine. The modification was checked by SDS PAGE and the enzyme activity 
was measured by using the artificial substrate PNP-glucopyranoside. SDS PAGE showed a number 

30 of discrete bands each representing a pegylated GCB species. The major bands corresponded to a 
GCB molecule with 6-8 conjugated PEG molecules. (Figure 6). The activity assays revealed that 
approximately 80% of the GCB activity was retained. The uptake of PEGylated GCB polypeptides 
was assayed using the J774E in vivo uptake assay. The result is shown in Fig. 7. It is evident that 
when 1-4 PEG molecules are attached to GCB, uptake is comparable to wildtype. 
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EXAMPLE 8 

N-GLYCAN STRUCTURES IN WTGCB EXPRESSED IN INSECT CELLS 

5 Approximately 350 ng of purified wtGCB expressed in Sf9 cells were dried in a SpeedVac 

concentrator, dissolved in 400 u.1 6 M guanidinium, 0.3 M Tris-HCl, pH 8.3 and denatured overnight 
at 37°C. Following denaturation, the disulfide bonds in the protein were reduced by addition of 50 
Hi 0.1 M DTT in 6 M guanidinium, 0.3 M Tris-HCl, pH 8.3. After 2 h of incubation at ambient 
temperature the thiol-groups present were alkylated by addition of 50 pi 0.6 M iodoacetamid in 6 M 

10 guanidinium, 0.3 M Tris-HCl, pH 8.3. Alkylation took place for 30 min at ambient temperature 
before the reduced and alkylated protein was buffer changed into 50 mM NHaHCOj using a NAPS 
column. The volume of the sample was reduced to approximately 200 ui in a SpeedVac concentrator 
before addition of 10 [ig trypsin. Trypsin degradation was carried out for 16 h at 37°C. The resulting 
peptides were separated by reversed phase HPLC employing a Phenomenex Jupiter Cig column (0.2 

15 * 5 cm) eluted with a linear gradient of acetonitrile in 0.1% aqueous TEA. The collected fractions 
were analysed by MALDI-TOF mass spectrometry before re-purification. Subsequently selected 
peptides were subjected to N-terminal amino acid sequence analysis. 

445 amino acid residues out of 497 (90%) were verified in the GCB sequence either through 
direct identification using chemical sequencing or through indirect mass identification of peptides 

20 using MALDI-TOF mass spectrometry. This is summarised in Table 5. 
0 





1 


ARPCIPKSFG 


YSSWCVCNA 


TTCDSFDPPT 


FPALGTFSKI 


ESTKSGRRHS 


50 


25 


. 51 


0 

LSMQPIQANX 


TQTQLLLTLQ 


PBCKFQKVKG 


FGGAMTDAAA 


LNI&U.SPPA. 
0 

DFQLHNFSLP 

VNGKGSLKGQ 


100 




101 
151 


QNLLLKSVTS 
EEDTKLKXPL 


ESGIGYNXIR 
IKRALQIJIQR 


VPMASCDFBI 
PVSLLASPWT 


RTYTYADTPD 
SPTHLKFUGA 


150 
200 


30 


201 


PODIYHQTWA 


JRYFVKFLDAY 


AEHKLQFWAV 


TAENZPSAGL 


IiSGYPFQCLQ 


250 




251 


FTPSHQRDF1 


0 

KRELOPTLAN 


STEHNVRLXM 


LDDQRLLLPH 


WXKWLTDPB 


300 


35 


301 


AAKYVHOXAV 


HWYLDFLAPA 


KATLQETXRL, 


FPN7WLFASS 


ACVaSKFWSQ 


350 


351 


SVBXiQSNJXRQ 


HQrSHSIXTN 


liLYHWGWTD 


mtiMNPsaa 


PNWVSNFVDS 


400 




401 


PXTVDXTKDT 


FYKQPWFYHL 

V 


QHFSKFIPEG 


SORVGLVASQ 


KNDLDAVALM 


450 


40 


451 


HPDGSAWW 


T1XDPAVGFL 


ETISP3YSXH 


TYLKRRQ 


497 



Table 5 

The amino acid sequence of wtGCB. Amino acid residues shown in italics are verified through mass 
identification of a peptide while amino acid residues in bold italics are verified through chemical 
45 sequence determination. 0 designates the four used N-glycosylation sites while -Lt designates the 
potential N-glycosylation site that is not used. 
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The amino acid sequence of GCB contains five potential N-glycosylation sites at Asnl9, 
Asn59, Asnl46, Asn270, and Asn462. The N-glycosylation site at Asn462 is not used in GCB 
expressed in CHO cells. Four glycosylated peptides were identified using combined data from 
5 MALDI-TOF mass spectrometry and N-terminal amino acid sequencing and purified. Each of these 
four peptides contains a single N-glycosylation site at Asnl9, Asn59, Asnl46, and Asn270, 
respectively. The peptide containing the potential N-glycosylation site at Asn462 was purified and 
the combined data from MALDI-TOF mass spectrometry and N-terminal amino acid sequencing 
showed Asn462 to be unoccupied in GCB expressed in Sf9 cells as in CHO cells. 

10 For the peptide containing Asnl9 (amino acid residues 8-39) the theoretical mass - 

including the three S-carboxamido-groups on Cys-residues 4, 16, and 18 - is 3608.57 Da. The 
peptide containing Asnl9 - identified through N-terminal amino acid sequence determination - gave 
experimental masses of 4501 .97 Da and 4341.1 1 Da in MALDI-TOF mass spectrometry. The mass 
differences between the theoretical mass and the experimental masses are thus 893.40 Da and 

15 732.54 Da. The mass differences correspond to Man 3 GlcNAc 2 (892.31 Da) and Man 2 GlcNAc2 
(730.26 Da) carbohydrate structures. 

Analogously, the peptides containing Asn59 (amino acids 48-74), Asnl46 (amino acids 
132-155), and Asn270 (amino acids 263-277) were analysed and the attached carbohydrate 
structures suggested. The results are summarised in Table 6. 

20 

Table 6 

Summary of MALDI-TOF mass spectrometry of the glycosylated wtGCB peptides. The masses 
given for the peptide comprising amino acid residues 8-39 includes the mass of the S-carboxamido- 
groups on Cys-residues 4, 16, and 18. 

25 



Amino 
acid 
residue 
no. 


Theoretical 

peptide 

mass 


Experimental 
masses 


Mass differences 


Suggested carbohydrate 
structures and their masses 


8-39 


3608.57 Da 


4501.97 Da 
4341.11 Da 


893.40 Da 
732.54 Da 


Man 3 GlcNAc 2 ; 892.31 Da 
Man 2 GlcNAc 2 ; 730.26 Da 


48-74 


2962.54 Da 


4001.24 Da 
3855.97 Da 


1038.70 Da 
893.43 Da 


Man 3 GlcNAc2Fuc; 1038.38 Da 
Man 3 GlcNAc 2 ; 892.31 Da 


132- 
155 


2846.26 Da 


3887.95 Da 
3740.16 Da 


1041.69 Da 
893.90 Da 


MansGlcNAcijFuc; 1038.38 Da 
Man 3 GlcNAc 2 ; 892.31 Da 
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263- 
277 


1630.82 Da 


2666.85 Da 
2504.73 Da 


1036.03 Da 
873.91 Da 


Man 3 GlcNAc 2 Fuc; 1038.38 Da 
Man 2 GlcNAc 2 Fuc; 876.33 Da 



The different carbohydrate structures were further characterised by subjecting the four 
peptides carrying carbohydrate to sequential exo-glycosidase treatments in combination with mass 
determinations. 

5 Below the typical N-glycan structure found on glycoproteins expressed in Sf9 cells is 

shown. The fucose-residue (Fuc) linkage is normally al,6, but can also be ccl,3 (indicated by "?") 

Man otl,6 Fuc cd,6 

I I? 
Man (31,4-GlcNAc 01,4-GlcNAc (3-Asn 

I I? 
Man al,3 Fuc al,3 

The sequential exo-glycosidase treatments consisted of overnight incubations at 37°C with 
the following enzymes - a(l-2,3,4)mannosidase, |3(l-4)mannosidase, cc(l-6)fucosidase, andN- 
glycosidase A. Between each enzyme treatment the mass of the peptides was determined using 
MALDI-TOF mass spectrometry. 

Following the treatments with a(l-2,3,4)rnannosidase and p(l-4)mannosidase it was still 
possible to obtain reasonable mass spectra of the peptides. However, the treatment with a(l- 
6)fucosidase introduced a significant amount of low molecular mass contaminants in the peptide 
samples and it was only possible to obtain data for the carbohydrate structure on Asn270. The same 
problem was also observed for the subsequent treatment with N-glycosidase A. 
The results are summarised in Table 7. 

In general, the results obtained are in accordance with the glycostructure shown above with 
the following specific positional details. 

Asnl9: 
Man al,6 

30 Man |3l,4-GlcNAc pi,4-GlcNAc p-Asn 

Man eel ,3 
and 



15 



20 
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Man al,6 



Man |3l,4-GlcNAc pl,4-GlcNAc p-Asn or 



Man pl,4-GlcNAc pl,4-GlcNAc P-Asn 



Man cd,3 



Asn59: 



Man al,6 



Fuc al,6 



Man al,6 



Man pl,4-GlcNAc pi,4-GlcNAc p-Asn or 



Man pi,4-GlcNAc pl,4-GlcNAc P-Asn 



Manal,3 



Man ccl,3 



Fuc al,3 



20 



25 



30 



35 



40 



45 



and 

Manal,6 

I 

Man pl,4-GlcNAc pl,4-GlcNAc p-Asn 
Manal,3 

Asnl46: 

Manal,6 Fuc al,6 Manal,6 

I I I 

Man pi,4-GlcNAc pl,4-GlcNAc p-Asn or Man pi,4-GlcNAc pl,4-GlcNAc P-Asn 

Manal,3 Manal,3 Fucal,3 

and 

Manal,6 
I 

Man P1,4-G1cNAc pl,4-GlcNAc P-Asn 
Man al,3 
Asn270: 

Man al,6 Fuc al,6 Man ocl,6 

I I I 

Man P1,4-G1cNAc Pl,4-GlcNAc P-Asn or Man P 1,4-GlcNAc pi,4-GlcNAc p-Asn 

Man al,3 Man al,3 Fuc al,3 



and 
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Man al,6 Fuc al,6 Fuc al,6 

I I I 

Man 01,4-GlcNAc pl,4-GIcNAc p-Asn or Man pl,4-GlcNAc pi,4-GIcNAc p-Asn 



5 Man al,3 

' or 



10 Man al,6 

I 

Man Pl,4-GIcNAc P 1,4-GlcNAc 0-Asn or Man pl,4-GIcNAc pi,4-GlcNAc p-Asn 

Fuc al,3 Man al,3 Fuc al,3 



15 



20 



25 



30 



35 



40 
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Table 7. Summary of the data obtained from 
exoglycosidase treatments of GCB 
glycopeptides. N.D.. not determined. 



Position 


Suggested 
carbohydrate 
structures and 
incorsii est 
glycopeptide 
masses 


Suggested 
carbohydrate 
structures, 
ineorexicai (i/ 
and 

experimental 
<£) 

gfycopeptide 
masses after 
treatment with 
o(1-2,3,4) 
mannosidase 


Suggested 
carbohydrate 
structures, 
theoretical (T) 
and 

experimental 
(E) 

glycopeptide 
masses after 
treatment 
with 

pyi-4) 

mannosidase 


Asn19 


Man 3 GlcNAc 2 
4500.89 Da 

Man 2 QlcNAc2 

A.nfl HA Da 


ManGlcNAca 
T: 4176.79 Da; 
E: 4176.05 Da 


GlcNAcz 

T: 4014.74 Da; 

E 4019.27 Da 


Asn59 


Man 3 GlcNAcaFuc 
4000.93 Da 

Man 3 GlcNAc2 
3854.87 Da 


ManGlcNAcaFuc 
T: 3676.83 Da; 
E: 3674.87 Da 

ManGlcNAca 
T: 3530.77 Da; 
E: N.D. 


GlcNACzFuc 
T: 3514.78 Da; 
E: 3511.36 Da 

GlcNAc2 . 
T: 3368.72 Da; 
E: N.D. 


Asn146 


Man 3 GlcNAc2Fuc 
3884.64 Da 

Man 3 GlcNAc2 
3738.57 Da 


ManGlcNAc 2 Fuc 
T: 3560.54 Da; 
E: 3557.30 Da 

ManGlcNAca 
T: 3414.48 Da; 
E: 3413.26 Da 


GlcNAc-jFuc 
T: 3398.49 Da; 
E: 3396.21 Da 

GlcNACz 

T: 3252.43 Da; 

E: 3252.42 Da 


Asn270 


Man 3 GlcNAc2Fuc 
2669.20 Da 

Man 2 GlcNAC2Fuc 
2507.10 Da 


ManGlcNAcaFuc 
T: 2345.1 Da; E: 
2345.80 Da 


GlcNAcaFuc 
T: 2183.05 Da, 
E: 21 83.10 Da 
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Table 7. Summary of the data obtained from 
exoglycosidase treatments of GCB 
glycopeptides. N.D., not determined. 



Position 


Suggested 


Theoretical 


carbohydrate 


CO and 




structures, 


experimental 




theoretical (T) 


(m 




and 


peptide 




experimental 


mass 




& 


after 




gtycopeptide 


treatment 




masses after 


with 




treatment 


N-oivcosidass 




with 


A 




a(1-6) 






fucosidase 




Asn19 


GicNApj 


T: 3608.57 Da 




T: 4014.74 Da; 


E: N.D. 




E: N.D. 


Asn59 


GlcNAc* 


T" 2962 54 Da 




T: 3368.72 Da; 


E: N.D. 




E: N.D. 




Asm 46 


GlcNAc2 




T: 2846.26 Da 




T: 3252.43 Da; 


E: N.D. 




E: N.D. 




Asn270 


GlcNACj. 


T: 1630.82 Da 




T: 2036.99 Da 


E: 1631.44 




E: 2036.59 


Da 




Da/21 83.64 Da 
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Glycosylation ofGCB polypeptides of the invention expressed in insect cells 

MALDI-TOF mass spectrometry was used to investigate the amount of carbohydrate 
attached to GCB polypeptides expressed in Sf9 cells. 

The 7 GCB polypeptide variants investigated all contained additional potential N- 
glycosylation sites compared to wtGCB. 

WtGCB contains 5 potential N-glycosylation sites of which only 4 are used. 

The 7 GCB polypeptide variants were: 
GC-36: ASPINATSPINAT-GCB, 
GC-38: ASPINAT-GCB(K194N,K32IN), 
GC-60: ANNTNYTNWT-GCB , 
GC-6I: ATNITLNYTANTT-GCB, 
GC-62: AANSTGNTITNGT-GCB, 
GC-63: AVNWTSNDTSNST-GCB, and 
GC-54: GCB-GGGG-Saposin C. 

WtGCB: 

The theoretical peptide mass of WtGCB is 55 591 Da. WtGCB has 5 potential N-glycosylation sites 
of which only 4 are used. As the two most common N-glycan structures on recombinant proteins 
expressed in Sf9 cells are ManjGlcNAc^Fuc and Man 3 GlcNAc2 having masses of 1038.38 Da and 
892.31 Da, respectively, the expected mass of WtGCB carrying 4 N-glycans is between 59 159 Da 
and 59 743 Da. 

MALDI-TOF mass spectrometry of wtGCB shows the broad peak typical of glycoproteins 
with a peak mass of 59.3 kDa in accordance with the expected mass of wtGCB carrying 4 N- 
glycans. 

GC-36 (ASPINATSPINAT-GCB): 

The theoretical peptide mass of GC-36 is 56 829 Da. The N-terminal extension contains two 
additional potential glycosylation sites at N5 and Nl 1 compared to wtGCB. Assuming that the 
wtGCB part of the variant is glycosylated lilce wtGCB, the variant has 6 potential N-glycosylation 
sites. 

As the two most common N-glycan structures on recombinant proteins expressed in Sf9 
cells are Man 3 GlcNAc2Fuc and Man 3 GlcNAc 2 having masses of 1038.38 Da and 892.31 Da, 
respectively, the expected mass of GC-36 carrying 4 N-glycans is between 60 397 Da and 60 981 
Da, the expected mass of GC-36 carrying 5 N-glycans is between 61 289 Da and 62 019 Da, and the 
expected mass of GC-36 carrying 6 N-glycans is between 62 181 Da and 63 057 Da. 
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MALDI-TOF mass spectrometry of GC-36 shows a rather broad peak with a peak mass 
between 61.5 kDa and 62.9 kDa in accordance with the expected mass of GC-36 carrying either 5 or 
6 N-glycans. 

N-terminal amino acid sequence analysis of GC-36 showed that N5 is completely 
glycosylated while Nil is partially glycosylated incomplete agreement with the result obtained 
using mass spectrometry. 

GC-38 (ASPINAT-GCB(K194N,K321N)): 

The theoretical peptide mass of GC-38 is 56 217 Da. The N-terminal extension contains one 
additional potential glycosylation sites at N5 compared to wtGCB. In addition, the substitutions of 
Lysl94 and Lys321 with Asn-residues introduce two additional potential N-glycosylation sites. 
Assuming that the wtGCB part LAR of the variant is glycosylated like wtGCB, the variant has 7 
potential N-glycosylation sites. 

Based on the same considerations as those used for GC-36, the expected mass of GC-38 
carrying 4 N-glycans is between 59 785 Da and 60 369 Da, the expected mass of GC-38 carrying 5 
N-glycans is between 60 677 Da and 61 407 Da, the expected mass of GC-38 carrying 6 N-glycans 
is between 61 569 Da and 62 445 Da, and the expected mass of GC-38 carrying 7 N-glycans is 
between 62 461 Da and 63 483 Da. 

MALDI-TOF mass spectrometry of GC-38 shows a major peak with a peak mass of 63.1 
kDa in accordance with the expected mass of GC-38 carrying 7 N-glycans. In addition, a minor peak 
with a peak mass of 62.3 kDa is seen which corresponds to GC-38 carrying 6 N-glycans. 

N-terrninal amino acid sequence analysis of GC-38 showed that N5 is completely 
glycosylated. 

GC-60 (ANNTNYTNWT-GCB) : 

The theoretical peptide mass of GC-60 is 56 770 Da. The N-terminal extension contains three 
additional potential glycosylation sites at N2, N5 and N8 compared to wtGCB. Assuming that the 
wtGCB part of the variant is glycosylated like wtGCB, the variant has 7 potential N-glycosylation 
sites. 

Based on the same considerations as those used for GC-36 the expected mass of GC-60 
carrying 4 N-glycans is between 60 338 Da and 60 922 Da, the expected mass of GC-60 carrying 5 
N-glycans is between 61 230 Da and 61 960 Da, the expected mass of GC-60 carrying 6 N-glycans 
is between 62 122 Da and 62 998 Da, and the expected mass of GC-60 carrying 7 N-glycans is 
between 63 014 Da and 64 036 Da. 
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MALDI-TOF mass spectrometry of GC-60 shows two broad peaks with peak masses of 
61.9 kDa and 62.8 kDa in accordance with the expected mass of GC-60 carrying either 5 or 6 N- 
glycans. 

N-terminal amino acid sequence analysis of GC-60 showed that N2 is mainly glycosylated, 
N5 is completely glycosylated while N8 is only seldom glycosylated in acceptable agreement with 
the result obtained using mass spectrometry. 

GC-61 (ATNTTLNYTANTT-GCB): 

The theoretical peptide mass of GC-61 is 56 970 Da. The N-terminal extension contains three 
additional potential glycosylation sites at N3, N7 and Nil compared to wtGCB. Assuming that the 
wtGCB part of the variant is glycosylated like wtGCB, the variant has 7 potential N-glycosylation 
sites. 

Based on the same considerations as used for GC-36, the expected mass of GC-61 carrying 
4 N-glycans is between 60 538 Da and 61 122 Da, the expected mass of GC-61 carrying 5 N- 
glycans is between 61 430 Da and 62 160 Da, the expected mass of GC-61 carrying 6 N-glycans is 
between 62 322 Da and 63 198 Da, and the expected mass of GC-61 carrying 7 N-glycans is 
between 63 214 Da and 64 236 Da. 

MALDI-TOF mass spectrometry of GC-61 shows a very broad peak with peak mass 
between 61.5 kDa and 63.0 kDa in accordance with the expected mass of GC-61 carrying either 5 or 
6 N-glycans. 

N-terminal amino acid sequence analysis of GC-61 showed that N3 is completely 
glycosylated while N7 and Nl 1 are partially glycosylated in acceptable agreement with the result 
obtained using mass spectrometry. 

GC-62 (AANSTGNTTINGT-GCB): 

The theoretical peptide mass of GC-62 is 56 806 Da. The N-terminal extension contains three 
additional potential glycosylation sites at N3, N7 and Nl 1 compared to wtGCB. Assuming that the 
wtGCB part of the variant is glycosylated like wtGCB, the variant has 7 potential N-glycosylation 
sites. 

Based on the same considerations as those used for GC-36, the expected mass of GC-62 carrying 4 
N-glycans is between 60 374 Da and 60 958 Da, the expected mass of GC-62 carrying 5 N-glycans 
is between 61 266 Da and 61 996 Da, the expected mass of GC-62 carrying 6 N-glycans is between 
62 158 Da and 63 034 Da, and the expected mass of GC-62 carrying 7 N-glycans is between 63 050 
Da and 64 072 Da. 
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MALDI-TOF mass spectrometry of GC-62 shows two broad peaks with peak masses of 
61.6 kDa and 62.7 kDa in accordance with the expected mass of GC-62 carrying either 5 or 6 N- 
glycans. 

N-terminal amino acid sequence analysis of GC-62 showed that N3 is completely 
glycosylated while N7 and Nl 1 are partially glycosylated in acceptable agreement with the result 
obtained using mass spectrometry. 

GC-63 (AVNWTSNDTSNST-GCB): 

The theoretical peptide mass of GC-63 is 56 969 Da. The N-terminal extension contains three 
additional potential glycosylation sites at N3, N7 and Nl 1 compared to wtGCB. Assuming that the 
wtGCB part of the variant is glycosylated like wtGCB, the variant has 7 potential N-glycosylation 
sites. 

Based on the same considerations as those used for GC-36, the expected mass of GC-63 
carrying 4 N-glycans is between 60 537 Da and 61 121 Da, the expected mass of GC-63 carrying 5 
N-glycans is between 61 429 Da and 62 159 Da, the expected mass of GC-63 carrying 6 N-glycans 
is between 62 321 Da and 63 197 Da, and the expected mass of GC-63 carrying 7 N-glycans is 
between 63 213 Da and 64 235 Da. 

MALDI-TOF mass spectrometry of GC-63 shows a major peak with a peak mass of 61.9 
kDa in accordance with the expected mass of GC-63 carrying 5 N-glycans. In addition, a minor peak 
with a peak mass of 62.9 kDa is seen which corresponds to GC-63 carrying 6 N-glycans. 

N-terminal amino acid sequence analysis of GC-63 showed that N3 ans N7 are partially 
glycosylated. It was not possible to evaluate the glycosylation status of Nl 1. 

GC-54 (GCB-GGGG-Saposin C): 

The theoretical peptide mass of GC-54 is 64 711 Da. The C-terminal saposin C extension contains 
one additional potential glycosylation sites compared to wtGCB. Assuming that the wtGCB part of 
the variant is glycosylated like wtGCB, the variant has 5 potential N-glycosylation sites. 
Based on the same considerations as those used for GC-36, the expected mass of GC-54 carrying 4 
N-glycans is between 68 279 Da and 68 863 Da while the expected mass of GC-54 carrying 5 N- 
glycans is between 69 171 Da and 69 901 Da. 

MALDI-TOF mass spectrometry of GC-54 shows a rather broad peak with a peak mass of 
68.4 kDa in accordance with the expected mass of GC-54 carrying 4 N-glycans. Thus, the N- 
glycosylation site in the saposin C extension is probably not used. 

Furthermore, insect cell expressed N-terminally extended glycosylated polypeptide (GC-6 
and GC-13) was subjected to N-terminal amino acid sequence analysis (using Procize from PE 
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Biosystems, Foster City, CA). The sequencing cycle was blank for the Asn residue in both ANIT 
and ASPINAT N-terminal peptide additions, demonstrating that the introduced glycosylation site is 
glycosylated. 

When subjecting GC-13 to mass spectrophometry using the MALDI-TOF techniques on the 
Voyager DERP instrument (from PE-Biosy stems, Foster City, CA) the following results were 
obtained: 

The wildtype and ASPINAT-extended wildtype expressed in insect cells gave average 
masses very close to the calculated mass of 59,727 Da and 61,421 Da, respectively, assuming that 
four glycosylation sites were occupied by the carbohydrates FucGlcNAc 2 Man 3 . 

EXAMPLE 9 

Expression ofGCB in CHO led 

The wtGCB-cDNA was isolated from pGC12 by digestion with Nhel and Xbal, and cloned into 
pcDNA3.1/Hygro+ (Invitrogen, Carlsbad, CA, USA) digested with Nhel and XbaL The resulting 
plasmid was then trans fected into CHO lecl cells (Mutant clonal derivative of Chinese hamster 
ovary CHO clone pro-5) (available from the American Type Culture Collection 10801 University 
Boulevard, Manassas, VA 201 10-2209, USA Item number CRL-1735) using Lipofectamin 2000 
(Cat no. 1 1668-019 Gibco BRL, Life Technologies). The day after transfection GCB activity in the 
transfecting medium and the cells were measured, using the PNP GCB Activity Assay, with the 
following result: Medium: 0.03 U/L; Cells: 2.99 U7L. 

The medium was then replaced with a selective medium DMEM/F12 (Cat no. 21041-025 
Gibco BRL, Life Technologies) + 10%FBS (Fetal Bovine Serum Cat no. 02-701 F Bio-whittaker 
Europe B-4800 venders Belgium) + 100 U/ml Penicillin/ 100 /ig/ml Streptomycin (Cat no. DE17- 
602E Bio-whittaker Europe B-4800 venders Belgium) +400 |ig/ml Hygromycin (Hygromycin B in 
PBS 50 mg/ml Cat no. 10687-010 Gibco BRL, Life Technologies). When cells were 100 % 
confluent in the selective medium, the GCB activity in the medium and the cells were measured as 
above resulting in the following activities: Medium: 0.05 U/L; Cells: 1.49 U/L. 

Independent clones were selected in microliter plates and 30 clones which grew in the 
selective medium were measured in the GCB Activity Assay, Three high-producing clones were 
selected for growth in T flasks. By lowering the pH of the medium to 6.5 and adding DTT to a 
molar concentration of 0.2 to 1.0 mM a relative high amount of GCB is secreted with an N- 
glycosylation structure believed to comprise 5 exposed mannose residues (a similar glycosylation 
structure was described for the G glycoprotein of vesicular stomatitis virus expressed in the same 
cell line as described in Robertson et al., Cell 13, pp. 515-526, 1978). 
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CLAIMS 

1. A polypeptide selected from the group of lysosomal enzymes and lysosomal enzyme activators, 
comprising at least one introduced glycosylation site as compared to a corresponding parent enzyme 
or activator. 

2. The polypeptide according to claim 1, wherein the glycosylation site(s) is/are introduced 
into the amino acid sequence of the mature form of the parent lysosomal enzyme or activator. 

3. The polypeptide according to claim 2, wherein the glycosylation site is introduced into a 
surface exposed position of the parent enzyme or activator. 

4. The polypeptide according to any of claims 1-3, wherein the glycosylation site is 
introduced into a position of the parent enzyme or activator that is occupied by a charged amino acid 
residue, in particular an amino acid residue selected from the group consisting of E, D, R, K and H, 
or a position that is located between position -4 and 44 relative to a lysine residue. 

5. The polypeptide according to any of claims 2-4, comprising at least 2-10 introduced 
glycosylation sites. 

6. The polypeptide according to any of claims 2-5, lacking at least one glycosylation site 
present in the parent enzyme or activator. 

7. The polypeptide according to any of claims 1-6, wherein the lysosomal enzyme or 
activator comprises an N-texrainal or C-terminal peptide addition as compared to the corresponding 
parent enzyme or activator, the peptide addition comprising or contributing to at least one 
glycosylation site. 

8. The polypeptide according to claim 7, wherein the peptide addition comprises 1-500 
amino acid residues. 

9. The polypeptide according to claim 7 or 8, wherein the peptide addition comprises 1-20, 
in particular 1-10 glycosylation sites. 

10. The polypeptide according to any of claims 1-9, wherein the glycosylation site is an in 
vivo glycosylation site, preferably an N-glycosylation site. 

1 1 . The polypeptide according to claim 10, wherein the peptide addition comprises a peptide 
sequence selected from the group consisting of INAT/S, GNIT/S, VNTT/S, SNIT/S, ASNIT/S, 
NTT/S, SPINAT/S, ASPDSTAT/S, ANIT/SANIT/SANI, ANIT/SGSNH/SGSNIT/S, 
ASNST/SNNGT/SLNAT/S, ANHT/SNET/SNAT/S, GSPINAT/S, ASPINAT/SSPINAT/S, 
ANNT/SNYT/SNWT/S, ATNIT/SLNYT/S ANT/ST, AANST/SGNTT/SINGT/S, 
AVNWT/SSNDT/SSNST/S, GNAT/S, AVNWT/SSNDT/SSNST/S, ANNT/SNYT/SNST/S, and 
ANNTNYTNWT, wherein T/S is either aT or an S residue, preferably a T residue. 
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12. The polypeptide according to claim 10 or 11, wherein the peptide addition has an N 
residue in position -2 or-1, and the lysosomal enzyme or activator has a T or an S residue in 
position +1 or +2, respectively, the residue numbering being made relative to the N-terminal amino 
acid residue of the lysosomal enzyme or activator. 

13. A chimeric polypeptide comprising a lysosomal enzyme unit linked to at least one unit 
of an activator for said enzyme. 

14. The polypeptide according to claim 13, wherein the enzyme unit and the activator unit(s) 
are linked by a peptide bond or peptide linker. 

15. A chimeric polypeptide comprising a lysosomal enzyme unit linked to at least one 
targeting polypeptide unit, the targeting polypeptide being capable of targeting phagocytic cells. 

16. The polypeptide according to any of claims 1-15, wherein the lysosomal enzyme or 
activator is one that binds to a mannose receptor. 

17. The polypeptide according to any of claims 1-16, wherein the lysosomal enzyme is 
selected from the group consisting of glucocerebrosidase (GCB), a-L-iduronidase, acid a- 
glucosidase, cc-galactosidase, acid sphingomyelinase, galactocerebrosidase, arylsulphatase A, 
sialidase, and hexosaminidase. 

18. The polypeptide according to any of claims 1-17, wherein the activator is Saposin A, 
Sapocin B, Sapocin C, Sapocin D, or GM-2 activator. 

19. The polypeptide according to any of claims 1-18, wherein the lysosomal enzyme is a 
glucocerebrosidase (GCB) polypeptide. 

20. The polypeptide according to claim 19, wherein the glycosylation site is an N- 
glycosylation site and polypeptide comprises one or more substitutions, relative to the amino acid 
sequence shown in SEQ ID NO:l, selected from the group consisting of K7N+F9T, K7N+*9T, 
K7N+*9S, K7N+F9S, K74N+Q76T, K74N+Q76S, K77N+K79T, K77N+K79S, K79N+F81T, 
K79N+F81S, K106N+Y108T, K106N+Y108S, K155N+K157T, K155N+K157S, K157N+P159T, 
K157N+P159S, K186N+N188T, K186N+N188S, K193N+S195T, K194N, K194T, K198N+Q200T, 
K198N+Q200S, K215N+L217T, K215N+L217S, E222N+K224T, K224N+Q226T, K224N+Q226S, 
K293N+L295T, K293N+L295S, K303N+V305T, K303N+V305S, K321N, K321N+T323S, 
K346N+W348T, K346N+W348S, K408N, K408N+T410S, K413N+P415T, K413N+P415S, 
K425N+I427T, K425N+I427S, K441N+D443T, K44IN+D443S, K466N+V468T, K466N+V468S, 
K473N+P475T and K473N+P475S. 

21. A polypeptide according to claim 19, wherein the glycosylation site is an N- 
glycosylation site and one or more amino acid residue of the parent GCB polypeptide selected from 
the group consisting of P6, G10, Yll, C23, T36, Y40, T43, E50, A95, L105, Y108, M133, D137, 
P171, L175, W179, K194, L240, A269, E235, F337, V343, E349, L354, Q362, S364, V398, H422, 
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E429, V437, D453, R463, T482, G486, P28, 134, E41, T61, L66, A84, 1130, T132, A136, S181, 
E152, P178, U85, H206, G255, A291, G250, V295, K321, G325, P332, 1367, G377, D405, K408, 
P465, L480 and 1489 (of the amino acid sequence shown in SEQ ID NO:l) is/are substituted with an 
asparagine residue. 

22. The polypeptide according to claim 19, wherein the glycosyiation site is an in vitro 
glycosylation site, e.g. selected from the group consisting of the N-terminal amino acid residue of 
the polypeptide, the C-terminal residue of the polypeptide, lysine, cysteine, arginine, glutamine, 
aspartic acid, glutamic acid, serine, tyrosine, histidine, phenylalanine and tryptophan. 

23. The polypeptide according to claim 22, wherein the in vitro glycosylation site is a lysine 

residue. 

24. The polypeptide according to claim 23, wherein one or more of the amino acid residues 
of wtGCB (SEQ ID NO 1) selected from the group consisting of R2, R39, R44, R47, R48, R120, 
R131, R163, R170, R211, R257, R262, R277, R285, R339, R353, R359, R395, R433, R463, R495, 
R496, H60, H145, H162, H206, H223, H255, H273, H274, H290, H306, H311, H328, H365, H374, 
H419, H422, H451, H490, D24, D27, D87, D127, D137, D140, D141, D153, D203, D218, D258, 
D263, D282, D283, D298, D358, D380, D399, D405, D409, D443, D445, D453, D467, D474, E41, 
E50, E72, Bill, El 12, E151, E152, E222, E233, E235, E254, E300, E326, E340, E349, E388, 
E429, and E481 bas/have been replaced with a lysine residue. 

25. The polypeptide according to any of claims 22-24, further lacking an in vitro 
glycosylation site present in wtGCB. 

26. The polypeptide according to claim 25, wherein a lysine residue present in wtGCB is 
substituted with another amino acid residue, preferably arginine, or deleted from one or more 
positions selected from the group consisting of K7, K74, K77, K79, K106, K155, K157, K186, 
K193, K197, K215, K224, K293, K303, K321, K346, K408, K413, K425, K441, K466 and K473 of 
the amino acid sequence shown in SEQ ID NO:l. 

27. A GCB polypeptide comprising a modification at any of amino acid residues 132-139 
relative to SEQ ID NO 1, resulting in reduced susceptibility to proteolytic degradation. 

28. The GCB polypeptide according to claim 27, wherein a glycosylation site is introduced 
into any of positions 132-139. 

29. The GCB polypeptide according to claim 27 or 28, comprising the mutation A136N, 
A135PorA136P. 

30. A polypeptide comprising at least one unit of a polypeptide targeting phagocytic cells, 
preferably macrophages or macrophage like cells, and a GCB polypeptide unit 

31. A polypeptide comprising a GCB polypeptide unit and at least one Saposin C 
polypeptide and/or a Saposin A polypeptide unit. 
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32. The polypeptide according to claim 30 or 31, wherein the different polypeptide 
constituents are linked with a peptide bond or a peptide linker. 

33. The polypeptide according to any of claims 30-32, wherein the GCB polypeptide is a 
polypeptide according to any of claims 19-29 or a wtGCB with an amino acid sequence included in 
SEQIDNO.l. 

34. The polypeptide according to any of claims 1-33, which is glycosylated. 

35. The glycosylated polypeptide according to claim 34, comprising at least one 
oligosaccharide chain comprising an exposed mannose residue. 

36. The polypeptide according to claim 34 or 35, which has a glycosylation profile 
characteristic of that provided by expression in an invertebrate cell. 

37. The polypeptide according to any of claims 34-36, which has the glycosylation profiled 
characteristic of that provided by expression in a yeast, insect, or plant cell. 

38. The polypeptide according to claim 37, wherein the insect cell is a Lepidoptora cell line. 

39. The polypeptide according to any of claims 34-38, wherein at least one oligosaccharide 
chain has the structure 

Asn-N-N-M-M 2 

wherein Asn indicates the Asn residue of the polypeptide to which the oligosaccharide chain is 
attached, N an N-acetylglucosamine residue, and M-M 2 three mannose residues two of which are 
linked to the same mannose. 

40. The polypeptide according to any of claims 34-36, which is expressed from a 
mammalian cell line and subsequently modified by sequential treatment neuramidase, galactosidase 
and p-N acetylglucosaminidase so as to obtain at least one exposed mannose residue. 

41. The polypeptide according to any of claims 34-40, comprising comprising 1-10 
oligosaccharide moieties. 

42. The polypeptide according to any of claims 36-41, which is expressed from a cell 
producing a mcose'-containing oligosaccharide structure, and wherein said polypeptide subsequent 
to expression is treated with a fucosidase. 

43. The polypeptide according to any of claims 1^2, which has at least one of the following 
properties: 

increased affinity for a mannose receptor or other carbohydrate receptor, • 

increased serum half-life 

increased functional in vivo half-life, 

increased in vivo bioactivity, 

reduced immunogenicity, 

increased resistance to proteolytic cleavage, and/or 
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increased targeting to or uptake in phagocytic cells or a suborganel compartment thereof. 

44. The polypeptide according to any of claims 19-43, which exhibits increased in vivo 
activity relative to a wildtype GCB (wtGCB). 

45. A nucleotide sequence encoding a polypeptide according to any of claims 1-44. 

46. An expression vector comprising a nucleotide sequence according to claim 45. 

47. A host cell transformed or transfected with a nucleotide sequence according to claim 45, 
or an expression vector according to claim 46. 

48. The host cell according to claim 47, which is an invertebrate cell such as an insect cell, a 
yeast cell or a plant cell, or a mammalian cell, in particular a glycosylation mutant thereof. 

49. The cell line according to claim 48, wherein the GCB polypeptide is a wtGCB or a 
variant or truncated form thereof or a GCB polypeptide according to any of claims 19-44. 

50. A CHO lecl cell line comprising a heterologous nucleotide sequence encoding a 
lysosomal enzyme or a lysosomal enzyme activator. 

51. A method of producing a polypeptide according to any of claim 1-44, comprising 
culturing the host cell according to any of claims 47-50 under conditions permitting the expression 
of the polypeptide and recovering the polypeptide from the culture. 

52. The method according to claim 51, further comprises subjecting the optionally 
glycosylated polypeptide to in vitro glycosylation. 

53. A method of improving at least one property of a lysosomal enzyme, which method 
comprises introducing an additional glycosylation site into the lysosomal enzyme to be improved, 
and producing the modified lysosomal enzyme under conditions ensuring that the enzyme is 
glycosylated. 

54. The method according to claim 53, wherein the lysosomal enzyme is a GCB 
polypeptide. 

55. The method according to claim 53 or 54, wherein the improved property is any of those 
mentioned in claim 43. 

56. A pharmaceutical composition comprising a polypeptide according to any of claims 1- 
44 and a pharmaceutically acceptable diluent, carrier or excipient 

57. The use of a polypeptide according to any of claims 1-44 or a pharmaceutical 
composition according to claim 56 for treatment or prevention of diseases. 

58. The use according to claim 57, wherein the polypeptide is a GCB polypeptide or 
Saposin C polypeptide or a chimeric polypeptide thereof, for treatment or prevention of Gaucher^ 
disease. 

59. The use of a polypeptide according to any of claims 1-44 or a pharmaceutical 
composition according to claim 56, wherein the polypeptide is a GCB polypeptide or Saposin C 
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polypeptide or a chimeric polypeptide thereof, for the manufacture of a medicament for treatment or 
prevention of Gaucher's disease, 

60. A method of treating Gaucher's disease, in which an effective amount of a GCB 
polypeptide according to any of claims 19-44, a Saposin C polypeptide or a chimeric polypeptide 
thereof is administered to a patient in need thereof. 

61. The use of a nucleotide sequence according to claim 45 in gene therapy, the nucleotide 
sequence encoding a lysosomal enzyme or activator thereof with at least one introduced in vivo 
glycosylation site as compared to a parent, naturally-occurring enzyme or activator. 
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SEQUENCES 



SEQ ID NO: 1 

Amino acid sequence of wtGCB - 

1 ARPCIPKSFG YSSWCVCNA TYCDSFDPPT 
61 TGTGLLLTLQ PEQKFQKVKG FGGAMTDAAA 
121 VPMASCDFSI RTYTYADTPD DFQLHNFSLP 
181 SPTWLKTNGA VNGKGSLKGQ PGDIYHQTWA 
241 LSGYPFQCLG FTPEHQRDFI ARDLGPTLAN 
301 AAKYVHGIAV HWYLDFLAPA KATLGETHRL 
361 MQYSHSIITN LLYHWGWTD WNLALNPEGG 
421 GHFSKFI PEG SQRVGLVASQ KNDLDAVALM 
481 ETISPGYSIH TYLWXRQ, wherein C is 



mature sequence 

FPALGTFSRY ESTRSGRRME LSMGPIQANH 
LNILALSPPA QNLLLKSYFS EEGIGYNIIR 
EEDTKLKIPL IHRALQLAQR PVSLLASPWT 
RYFVKFLDAY AEHKLQFWAV TAENEPSAGL 
STHHNVRLLM LDDQRLLLPH WAKWLTDPE 
FPNTMLFASE ACVGSKFWEQ SVRLGSWDRG 
PNWVRNFVDS PIIVDITKDT FYKQPMFYHL 
HPDGSAVWV LNRSSKDVPL TIKDPAVGFL 
R or H. 



SEQ ID NO 2 

DNA sequence of the wt glucocerebrosidase used for the 
expression in insect cells including signal sequence- 

ATGGCTGGCAGCCTCACAGGATTGCTTCTACTTCAGGCAGTGTCGTGGGCATCAGGTGCCCGCCCCTGCATCCCT 
AAAAGCTTCGGCTACAGCTCGGTGGTGTGTGTCTGCAATGCCACATACTGTGACTCCTTTGACCCCCCGACCTTT 
CCTGCCCTTGGTACCTTCAGCCGCTATGAGAGTACACGCAGTGGGCGACGGATGGAGCTGAGTATGGGGCCCATC 
CAGGCTAATCACACGGGCACAGGCCTGCTACTGACCCTGCAGCCAGAACAGAAGTTCCAGAAAGTGAAGGGATTT 
GGAGGGGCCATGACAGATGCTGCTGCTCTCAACATCCTTGCCCTGTCACCCCCTGCCCAAAATTTGCTACTTAAA 
TCGTACTTCTCTGAAGAAGGAATCGGATATAACATCATCCGGGTACCCATGGCCAGCTGTGACTTCTCCATCCGC 
ACCTACACCTATGCAGACACCCCTGATGATTTCCAGTTGCACAACTTCAGCCTCCCAGAGGAAGATACCAAGCTC 
AAGATACCCCTGATTCACCGAGCACTGC^GTTGGCCCAGCGTCCCGTTTCACTCCTTGCCAGCCCCTGGACATCA 
CCCACTTGGCTCAAGACCAATGGAGCGGTGAATCGGAAGGGGTCACTCAAGGGACAGCCCGGAGACATCTACCAC 
CAGACCTGGGCCAGATACTTTGTGAAGIITCCTGGATGCCTATGCTGAGCACAAGTTACAGTTCTGGGCAGTGACA 
GCTGAAAATGAGCCTTCTGCTGGGCTGTTGAGTGGATACCCCTTCCAGTGCCTGGGCTTCACCCCTGAACATCAG 
CGAGACTTAATTGCCCGTGACCTAGGTCCTACCCTCGCCAACAGTACTCACCACAATGTCCGCCTACTCATGCTG 
GATGACCAACGCTTGCTGCTGCCCCACTGGGCAAAGGTGGTGCTGACAGACCCAGAAGCAGCTAAATATGTTCAT 
GGCATTGCTGTACATTGGTACCTGGACTTTCTGGCTCCAGCCAAAGCCACCCTAGGGGAGACACACCGCCTGTTC 
CCCAACACCATGCTCTTTGCCTCAGAGGCCTGTGTGGGCTCCAAGTTCTGGGAGCAGAGTGTGCGGCTAGGCTCC 
TGGGATCGAGGGATGCAGTACAGCCACAGCATCATCACGAACCTCCTGTACCATGTGGTCGGCTGGACCGACTGG 
AACCTTGCCCTGAACCCCGAAGGAGGACCCAATTGGGTGCGTAACTTTGTCGACAGTCCCATCATTGTAGACATC 
ACCAAGGACACGTTTTACAAACAGCCCATGTTCTACCACCTTGGCCATTTCAGCAAGTTCATTCCTGAGGGCTCC 
CAGAGAGTGGGGCTGGTTGCCAGTCAGAAGAACGACCTGGACGCAGTGGCATTGATGCATCCCGATGGCTCTGCT 
GTTGTGGTCGTGCTAAACCGCTCCTCTAAGGATGTGCCTCTTACCATCAAGGATCCTGCTGTGGGCTTCCTGGAG 
ACAATCTCACCTGGCTACTCCATTCACACCTACCTGTGGCGTCGCCAGTGA 

SEQ ID NO 3 Saposin C wt amino acid sequence 

SDWCEVCEFLVKEVTKLIDNNKTEKEILDAFDKMCSKLPKSLSEECQEVVDTYGSSILSILLEEVSPELVCSML 
HLCSG 



SEQ ID NO 4: Chimeric SapC-linker-GCB polypeptide 

HLCS GGGGGSGGGGSGGGGSA RPCI PKSFGYSSWCVCHATYCDSFDPPTFPAT.r;TPC!R VRGth grtp pmrt . ^Mgp 

IQANHTGTGlXLTLQPEQKFQKVKGFGGAMTDAAAI^tniiALSPPAQKrLLIjKSYFSEEGIGYN'IIRVPMASCDFSI 
RTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPWTSPTWLKTNGAVNGKGSLKGQPGDIY 
HQTWARYFVKFLDAYAE HKLQFWAVTAENE PSAGLL S GY P F QCLGFTP EHQRDFI ARDLGPTIiANSTHHNVRLLM 
LDDQRLLLPEBffAKWLTDPEAAKYVHG IAVHHYLDFLAPAKATLGETHRLFPNTMLFAS EACVGSKFWEQSVRLG 
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SWDRGMQYSHS IIT^^LYHWGWTDWNI^NPEGGPNWVHNFVt)S PI IVDITKDTFYKQPMFYHLGHFSKFI PEG 

SQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVPLTIKDPAVGFLETISPGYSIHTYLWRRO 
(SEQ ID NO 4) 
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